Differentiation of segmentally identical expressions occurring in the same or different sentence zones in Hungarian by duration, pitch, intensity and irregular voicing

The paper presents the interpretation and explanation of the findings of two pieces of experimental research within the framework of Varga’s (2016) pitch-tier model of the Hungarian declarative sentence. One of the experiments was established to investigate the information-structural contribution of quantified expressions (such as mindhárom barátom ‘all three of my friends’ and Csaba is ‘Csaba also’). The other experiment explored the acoustic features of the spontaneous-speech specific discourse marker hát ‘well/so’. The two topics can be regarded as interconnected if Varga’s model is interpreted in the strong sense that pitch – presumably in a more or less strong correlation with intensity – is responsible for indicating the topic–comment dichotomy and other factors of the discourse-embedding of sentences. Thus, the reconciliation of our data with Varga’s model requires the consideration of the pitch-tier substructures in their complex dynamism. The experiments support the plausible hypothesis that the variants of the discourse marker hát as part of the preparatory contour primarily differ in duration, while is-quantifiers in different pitch-tier parts differ in terms of pitch values.


INTRODUCTION
The paper presents the findings of two pieces of experimental research which potentially challenge Varga's (2016) model on the topic-comment dimension of the Hungarian declarative sentence. In section 2, we provide a concise discussion of Varga's (2016) proposal concerning the contribution of intonation to the realization of simple declarative sentences in Hungarian. Sections 3 and 4 provide detailed descriptions of two experiments. The first one is a read-aloudexperiment for investigating ten frequent functions of the Hungarian spontaneous-speech specific discourse marker hát 'well/so', distinguishable on the basis of the pragmasemantic toolbox offered by Alberti et al. (2019). The other applies the same read-aloud methodology to the acoustic characterization of quantified expressions such as mindhárom barátom 'all three of my friends' and Csaba is 'Csaba also' used as part of the topic and the comment zone (Szeteli & Alberti 2018b). The conclusion summarizes our findings in light of Varga's (2016) work.
The experiments support the hypothesis that if, and only if, segmentally identical expressions belong to different parts of the pitch-tier model, they differ in pitch, while expressions belonging to the sentence-initial preparatory contour differ in duration. However, the experiment on quantifiers requires a somewhat more complicated model. Varga (2016) provides a detailed analysis of the grammatically and informationally relevant intonational facts of simple Hungarian declarative sentences, concentrating on attitudinally neutral intonational solutions.

VARGA'S (2016) MODEL OF THE TOPIC-COMMENT DIMENSION OF THE HUNGARIAN DECLARATIVE SENTENCE
In his approach, intonation means superimposing certain pitch patterns on the segmental strings of sentences, and thereby producing utterances. The smallest units that are directly relevant to the realization of pitch patterns are not segments (sounds), but syllables, which in turn are composed of segments. The recurring pitch patterns that the syllables of utterances carry are called intonation contours, which Varga (2016) regards as being meaningful and having characteristic shapes, of which the most important ones are (a) the Full Fall, symbol: [ \ ], (b) the Rise [ / ], and the Half Fall [ | ]. 1 Some of the syllables are accented in the following sense: they "are stressed (i.e. they have extra intensity or some other, non-pitch-involving feature that 1 The (plurisyllabic) Full Fall is such that the voice radically drops down between the first and the second syllables. The Full Fall ends on the bottom pitch of the speaker's normal voice range and ends in a pause, while its starting point can be at different heights. The Half Fall is similar in shape to the Full Fall but it does not reach down to the bottom pitch of the speaker and does not end in a pause. The Rise is the name either of a steadily rising contour, or of a contour which keeps level for a large part and then moves upwards at its end. In both cases it can be high or low. Fig. 1 illustrates all these types of contour. It also illustrates a preparatory contour (on the string és akkor 'and then'). The shape of a preparatory contour is similar to a sustained contour (i.e. it can be rising, level, descending) but it lacks the extra prominence on the first syllable of its carrier string. It is realised anywhere in the region between mid low and high. The diagrammatic representation contains filled dots for the accented syllables and short lines for the other syllables, arranged at different heights above a long horizontal line, which represents the bottom pitch of the speaker's voice. Below this line the written text of the utterance can be seen, provided with the tonetic accent marks shown in the paragraph this footnote belongs to.
gives them extra prominence) and, in addition, they are pitch-prominent (i.e. they are associated with a pitch-event, in the sense that they initiate an intonation contour). … From the point of view of intonation it is the accented syllables that play a crucial role" (Varga 2016, 49). Intonation contours appear as melodic constituents within certain phonological structures called Intonation Phrases (IPs). IPs are units of intonation, i.e. containers of connected intonational events, with a characteristic internal structure. The obligatory part of Hungarian IPs is the Terminal Part, which begins on the last (or only) accented syllable. Varga (2016, 75) provides a detailed argument in favour of the topic-comment dichotomy of the Hungarian sentence, which we illustrate in Fig. 1 by means of the most general pattern he provides.
The parentheses in the boxes associated with Fig. 1 mark the optional parts of the general intonation structure of Hungarian declarative sentences. The general structure thus consists of an optional Topic IP, which optionally contains an initial preparatory contour and obligatorily contain a Rise Terminal Contour, and an obligatory Comment IP, the (only) obligatory component of which is its Terminal Contour: a Full Fall, as a definitive part of the declarative sentence type. Both intonation phrases can contain, before their obligatory Terminal Contours, further intonation contours (called 'Scale'), which tend to be Half Falls.
As also presented in Fig. 1, the expressions targeted in the experiments (which constitute the main empirical focus of this paper) are distributed in the general intonation structure as follows. Several variants of the discourse marker hát, the sentence-initial ones, are unaccented inhabitants of the preparatory contours of intonation phrases. The sentence-final hát variants are (not necessarily proper) parts of the terminal Full Fall of the obligatory Comment IP. The quantifiers under investigation are accented components of certain intonation phrases. Varga relies on the traditional view that quantifiers (DistP p ), if any, must appear at the beginning of the  Varga (2016, 52) Acta Linguistica Academica 69 (2022) 2, 163-187 comment, see Table 1; while we intend to point out on the basis of parallel pragmasemantic and phonetic phenomena that they can also function as (non-contrastive) topics. This pitch-tier model presented in Fig. 1 is of high importance for our approach since the expressions we are considering are hosted in four differentiated pitch-tier zones. On the other hand, our approach serves as a valuable contribution to Varga's (2016) model by providing a solid empirical foundation through rigorous statistical analysis. Varga's (2016) model leads to the following hypothesis concerning the role of duration and its division of labor with pitch, at least as a first approximation: if, and only if, segmentally identical expressions belong to different zones of Varga's pitch-tier model, their vocal differentiation is primarily due to their difference in pitch. The inherently unstressed hát-variants in the sentence-initial preparatory contour provide evidence for one direction of the hypothesis: if segmentally identical expressions belong to one and the same Varga-zone, they are correctly predicted to primarily differ in duration (without using pitch characteristics to signal differences). The other direction, according to which segmentally identical quantifier-like expressions in the topic zone and in the comment zone should be distinguishable on the basis of certain pitch values, is only partially corroborated by our analysis (4.1). A simple "static" approach is thus not satisfactory; the two IPS can be safely distinguished on the basis of a global "dynamic" intonational analysis (4.2).

THE HUNGARIAN DISCOURSE MARKER HÁT
The first experiment, a read-aloud experiment of the most frequent Hungarian spontaneous speech specific discourse marker hát 'well/so' will serve as the topic of this section. The experiment was established to extend a former pilot study on the prosodic realization of the discourse marker (Szeteli, Gocsál & Alberti 2019) in multiple ways. On the one hand, it involved a higher number of participants (n 5 53), both females and males (28:25). On the other Table 1. Varga's (2016, 55) proposal for the (linearly ordered) structural positions in positive declarative sentences, on the basis of É. Kiss's (2002) The obligatory comment is preceded by optional structural positions for Topic (T). Within the comment of positive Hungarian sentences we distinguish the following structural positions: Distributive (Dist) position, Positive Degree/ Manner Adverb (PDMA) position, Focus (F) position, Verb Modifier (VMod) position, Verb (V) position, and Postverbal (PostV) position. All of these positions are optional, apart from the V position, which is obligatory in a prototypical Hungarian sentence. The F and the VMod positions are immediately before the V position, but they mutually exclude each other: either or neither of them is present but they cannot both be present before the verb. If the F or the VMod position is present, the constituent in it is accented, and the verb in the V position is accentless. All these positions are summed up in Table 1, where positions that are optional and repeatable are marked with the Kleene star: T p , Dist p , PostV p ; positions that are optional and non-repeatable are in parentheses: (F), (PDNA), (VMod); and the V position, which is obligatory and non-repeatable, has neither a Kleene star nor parentheses.
hand, it can serve as the constructive replication of Dér & Markó's (2017) study in the sense that some of the categories that were added to this extended experiment were created considering the criteria Dér and Markó used by identifying different monofunctional uses of the discourse marker hát in a database of Hungarian spontaneous speech, dubbed BEA (BEszélt nyelvi Adatbázis). We examined hát in ten different contexts (also in different turn and sentence positions) in order to find out which categorization can pinpoint relevant differences in their usages from the perspective of pragmatics. Although hát went through a decategorization process, similarly to other discourse markers, its syntactic features (cf. Szeteli & Alberti 2018a) are worth to be discussed in an exhaustive syntactic theory in accordance with its prosodic realization. Since the usage of the discourse marker hát can only be adequately interpreted in whole utterances or, better, in discourse context, data collection was built around test situations. In this section, the test situations and their pragmasemantic differences are discussed, together with the connection of the discourse marker to the topic-comment dimension of the Hungarian declarative sentence.
In the test, various scenarios were described, at the end of which different alternatives were given as answers to the question presented in the test situation. These answers were designed to represent the alternative functions of hát, which were intended to be investigated. The first subset of situations was the corresponding one of the former pilot study of Szeteli, Gocsál & Alberti (2019). These situations are based on a hypothesis by Alberti (2016) called the "semaphore effect", described in the pragmasemantic framework ℜeALIS . According to that hypothesis, the main difference between the various usages of hát and other discourse markers lies with exactly that semaphore effect, and the particular functions should be separated with respect to that feature. This theory is strongly rooted in discourse representation and its major insight is that discourse markers can signal how easy/difficult it will be for the listener to digest the message at hand. In this regard, the reciprocal attitudes of the speaker and the listener to each other and directed to the propositional content of a given sentence should be considered as well. In our research, we embedded the monofunctional uses in short dialogues between A (read out by the conductor of the experiment) and B (read out by the participant), and we built very explicit contexts around them. The participants had around ten minutes to familiarize themselves with the situations. They were also asked to speak as naturally as possible, reading out every single word in the given word order and playing the role by using a tone that is not monotonous. Elements of punctuation suggestive of tonal features were ommitted.
The story had the following common frame with nine different outcomes (containing ten samples of hát), depending on the answer of character B: A and B are twenty-year-old university students in love with each other, who have been going to the cinema for a year on a weekly basis. They take turns in choosing the movies and they know each other's tastes quite well. This time it is B's turn to decide and this is an excerpt from their dialogue when B has already browsed the cinema program and s/he has made her/his decision, but s/he has not told A yet, who is eagerly waiting for the "announcement of the result". Three movies are competing, namely (i) an English detective story nothing out of the ordinary, (ii) an Icelandic drama which seems depressive and (iii) an American comedy presumably full of dirty jokes.
For one subset of the situations five different (but uniformly) turn-initial positions were examined, out of which four were sentence-initial ones, and one was sentence-final. In the examples shown in (T1-9) 'a.' always gives the context (the thoughts, more formally the intension, of B) to the particular dialogue, 'b.' is the same question of A in every case and the rest of the example is the answer to that question, containing the discourse marker hát with different functions, concerning the given context. 3 The first four situations were built up to express the functions of hát as a semaphore and their pragmasemantic contribution was elaborated in the representationalist framework ℜeALIS by Szeteli (2019) and Szeteli, Gocsál & Alberti (2019). The straightforward answer (which will appear as h1) can be uttered in a situation that is expected for both participants. However, hát in an uncertain answer (h2) is a marker of the difficulty of making a choice: a. B is not sure if s/he has made the best decision. S/he feels that based on the current cinema program there is no decision which could be argued enthusiastically. S/he thinks that their mutual experience is such: In the past, several detective stories proved to be boring but acceptable. A tends to choose comedies, but last week even s/he herself/himself was shocked by the tastelessness of the movie s/he had opted for. Finally, A cannot tolerate dramas. b. A: Na, melyik filmet választottad?
'That more or less works for both of us. Last week even you got utterly fed up with the tasteless comedies.
As for depressive dramas, they don't enthuse even me.
Or are you very tired of watching detective stories?' In our experiment, the anxiety of the speaker due to the listener's contrary desire was considered as a different case than (T2). Szeteli (2019 370) found that in this subtype of uncertainty (uncertainty resulting from the partner's opinion) the discourse marker hát is more often collocated with a filled pause than by the other subtype (T2), although this is only a tendency.
(T3) Uneasy answer (Hátöö -h3) a. B feels that detective stories are getting more and more boring. As for comedies, even A has recognized that they are no longer funny but rather disgusting; but what s/he (A) truly and deeply hates are depressive Nordic dramas. B, however, has heard about this Icelandic drama from a university friend, a philologist. A is jealous of that friend, maybe not without a reason. B finally decides to exercise his/her right but with a deep concern… What if A wants to know who suggested that movie… b. A: Na, melyik filmet választottad? c. B: Hát a drámát B: well/so the drama.ACC B: 'Well… the drama.' d. Tudom hogy nem nagyon rajongsz ezért a műfajért de mintha ezt az izlandi filmet valahol nagyon dicsérték volna Asszem valami díjat is nyert A krimiket már kissé unom a mostanában futó vígjátékok alpáriságából pedig a múltkor már neked is eleged lett úgy emlékszem 'I know that you are not very keen on this kind of movies but this Icelandic one was praised… somewhere… and I also think it received some kind of award. I'm getting a little bored with detective stories and last time even you got fed up with the disgusting comedies as far as I can remember.' Furthermore, the discourse marker hát can express complex attitudes such as, for example, teasing. However, this is only an option, since the characteristic tone of teasing, "waving at length", can be associated with different elements in the sentence (Szeteli 2019 370), so it should rather be described as a separate feature of pragmatics.
(T4) Teasing and Confirmation (HáÁáÁát and …hát! -h4 and "final" hf) a. B feels that detective stories are getting more and more boring but what A truly hates are depressive Nordic dramas. S/He would get depressed, and s/he would probably badger her/him the whole evening that the freaking movie had been recommended by the hot friend from the university, the charming philologist. Therefore, B decides to choose the comedy (it cannot be as tasteless as last week's one), but now s/he (B) can afford to tease her/him (A) in exchange for her generous decision. Her/his (B's) goal is as follows: At first, A should not believe that s/he is prepared to watch a comedy after last week's nightmare. This should make A even happier and the evening will be great. The reason why it is extremely difficult to account for the meaning or function of hát is that it has several different interpretations depending on the mental state of the speaker who utters it. It can indicate strong-mindedness and hesitation, and it can carry complex suprasegmental tones such as teasing, as shown in (T1-T5). Furthermore, there is no consensus on the relevant functions of the discourse marker between researchers. We attempted to involve situations in our read-aloud experiment to find out the relationship between our former categories and the categories of Dér & Markó (2017). Dér & Markó (2017) investigated monofunctional uses of the discourse marker in their corpus-based study and examined distinct turn positions. They identified a turn-initial, a turnmedial and a turn-final position, and they also had a fourth, purely pragmatic category for hát indicating judgment. From these four categories, only the turn-medial hát proved to be significantly longer than the others. The other functions of hát did not differ in their duration characteristics from the straightforward function identified by Alberti (2016), in our experiment. However, h6 affects another level of discourse while introducing a new topic instead of being associated with the propositional content of the sentence. We identified this as the 'metatalk' feature of hát to suggest the concept of the rhetorical relations (cf. Asher & Lascarides 2003 A sentence-final inferring hát was involved since it preserves the inferential core meaning of hát in contrast to hf, which is used for signal confirmation. Their prosodic characteristics highly differ, h9 is a regularly voiced hát whilst hf is voiced irregular in most of the cases. A Chi-square test of independence was carried out to determine if there is an association between creaky voice quality and hát function. The results confirmed a statistically significant association, c 2 (1, N 5 106) 5 20.699, P < 0.001, which means that irregular phonation is a characteristic feature of hf (presumably with a hidden variable in the background). 4  2 shows the duration data of the ten investigated functions and also gives a brief summary of their pragmasemantic contribution. This information is given in the formalism of the representationalist framework ℜeALIS ). The strings (e.g. iBuB þ ) represent possible wordlets of their holder, which is indicated by the first symbol of them (i 5 I (the speaker), u 5 you (the listener), r 5 underspecified holder). The second symbol represents the modality of the possible wordlet, associated with the propositional content of the sentence by the holder of the wordlet (indicated by the first symbol, mentioned above). It can be, for example, a belief (B), desire (D) or intension (I). The modality is followed by an element of the powerset of T 5 {À,þ,0} which indicates the temporal features of the wordlet ('À' for past, 'þ' for future, '0' 4 Note in passing that variant h9 is unstressed, just like h1, …, h8, in contrast to hf. In the relevant example this unstressed hát occurs in the tail of a sentence-final Full Fall, but in other examples it could be in the tail of a scalar (5 sentence medial) Half Fall, too (see Varga 2016, 52), just after the verb: | Kiválasztottam (te)hát a \ vígjátékot. Thus, the most important difference between hf and h9 is that the former is strongly stressed (A | vígjátékot \ hát!), while the latter is unstressed. The former takes the sentence-final Full Fall on itself, the latter is in the tail of the sentence-final Full Fall (see Fig. 1 in Section 1).
for present, in the string usually unmarked) if that string is followed by a new holder, modality and temporal feature, it means that it is embedded into the previous wordlet. A scale was introduced for the values referring to false (À5), true (þ5), underspecified (0) and to various degrees between them (which more or less correspond to Farkas & Roelofsen's (2017) credence levels). The '-s and ˮ-s express our claim that the truth values show a character which can be described by a certain distribution. The values are not specified in every case, so the underlined part of the string can show which part of the worldlet became a specified one due to the given value. Our claim, that h5 is the counterpart of h3 from a representationalist perspective, is now understandable: by h3 the listener was supposed (iBuDþ5"þ5•-5') to reject the movie (uD5-5'), however, in h5 the speaker would reject it (iD5-5') if s/he could. The crucial differences in the length feature are shown in Fig. 2, but the results of the investigation of the pausing after hát (shown in Fig. 3) are also highly consistent with the semaphore effect hypothesis of Alberti (2016). The characteristics in pausing tend to follow the patterns predicted by the pragmasemantic representations of the tested situations. The blue columns all show the data of uttering hát while having a high value on the belief scale and on the scale which says "your listener will know that you have a high value on the belief scale" (iBuB þ 5iBuB þ iB þ ∈ 'þ5), too. However, h1the purest form of that functionis followed by the shortest pausing. It was also mentioned that the uneasy hát is a result of hát and the filled pause after it, which can be regarded as a collocation. Fig. 3 shows that the other types of hát without the negative character in the desire did not collocate with long pauses after them. straightforward (h7, h1, h6, h8, h9): iBuB + =iBuB + iB + ∈ '+5 resigning (h5): iD=-5', iBuB + iD=-5' uneasy (h3): iBuD+ ∈ "+5•-5', iBuB + rD∈''+5, iIuB+iBuD+ ∈ "+5•-5' uncertain (h2): iBuB + =iBuB + iB + ∈ '0' The dialogues and all the data discussed in the paper were recorded with a dictaphone (44,1 kHz/16 bit). The relevant extracts from the recordings were analyzed in PRAAT 6.0.24. The statistical calculations were executed by SPSS 23. Further relevant data are summarized in an appendix.
It would go beyond the scope of the paper to thoroughly discuss either the formal pragmasemantic details summarized in Fig. 3 or the sophisticated statistical apparatus, partly presented in Figs 3 and 2, on the basis of which the phonetic forms of several hát variants can be distinguished from each other. What is relevant now is that it was possible to point out significant differences in temporal characteristics; as for pitch characteristics, however, as indicated by the yellow line in Fig. 4, there are no significant differences among the different sentenceinitial usages. Given that variants h1-h8 occur in the preparatory contour with h9 and hf occurring in the Full Fall (see Fig. 1), this can be explained by the following plausible consequence of the pitch-tier model: if segmentally identical expressions belong to one and the same pitch zone, they will primarily differ in duration (without using pitch characteristics to signal differences) (Fig. 5).

THE TOPIC-COMMENT VARIABILITY OF RELEVANT-SET BASED OPERATORS
The second experiment under discussion was established to investigate the contribution of Hungarian quantified noun phrases to the logical dimension of information structure and the topic-comment dimension. These phrases show a logico-pragmatic relation between their Fig. 3. Pausing values (r i ) after the sentence-initial hát variants (h i , respectively) referred to as h7, …, h4 in Fig. 2 [ms] (bar charts with 95% confidence intervals) Acta Linguistica Academica 69 (2022) 2, 163-187 explicit meaning and a broader implicit set, due to the relevant-set based operators. 7 Furthermore, they indicate the new or correcting information in a topic-comment dimension, which belongs to the broader construction of the discourse. In this section the theoretical background of the experiment will be briefly discussed, concentrating on the topic-comment aspect of the The f0 values were detected at the beginning, at the midpoint and at the end of the given vowels. 7 É. Kiss (2002, 77-78) explains the concept of a relevant set as follows. Semantically, the (identificational) focus is more than merely non-presupposed information; it expresses exhaustive identification from among a set of alternatives. Consider (1a 0 ); this sentence is used in a situation or context in which it can potenitally be true of a set of persons, including Csaba, that they have come. It expresses that, of this set, it is Csaba and no-one else for whom it holds that he has come. The identificational focus thus operates on a set of alternative individuals for whom the VP can potentially hold, exhaustively identifying the subset for which the VP does hold, excluding the complementary subset. The latter two disjoint subsets, whose union is the relevant set, can be called the positive and the negative subset of the relevant set. These concepts can be extended to quantifiers as well. In the case of an each-quantifier (1b), the positive subset coincides with the whole relevant set, with an empty negative set. In the case of an also-quantifier, see also (1b), it is claimed that there is (∃) at least one (implicit) member of the relevant set for whom the VP (also) holds, in addition to the participants explicitly claimed in the given sentence to be such that the VP is true of them. It is due to this existential factor (relative to the entire relevat set) that also-quantifiers are logically distingushed from each-quantifiers by means of the existential/universal dichotomy in (3-4). This differentiation is not in conflict with the fact that, if the corresponding positive set is considered, both types of quantifier in question function in a distributive universal way: even an also-quantifier expresses that the VP holds (separately) for each participant in the positive set.
phrases rather than the logico-pragmatic one (but see Szeteli & Alberti 2018b). Then the set-up and the analyzed data of the experiment will be presented. The noun phrases in question are modified by the universal determiner mind(en) 'every' and the clitic is 'also', thus É. Kiss (2002) calls them quantified noun phrases. In contrast to general Hungarian noun phrases (1a-a 0 ), they are only compatible with the verbal prefixverb stem word order (1b-b'). In this respect, they mirror expressions modified with csak 'only' or nemcsak 'not only' (2a), which onlytolerate the verbal prefixverb stem word order (Szeteli & Alberti 2018b Nevertheless, the quantifiers which never evoke the other possible verbal prefixverb stem order presented in (1b) and (2a), both can have both a topic (3) and a predicate/comment function (4) from a prosodic and pragmasemantic point of view. As mentioned in Section 1, Varga (2016), following the Hungarian generative literature, does not consider (non-contrastive) topic-like quantifiers (3b, d) at all; Szeteli & Alberti (2018b) have pointed out their existence by means of the radically different contexts presented below: it is to be investigated how these expressions work in a broader world of the discourse in a basically Büringian (1997a, b) style (cf. Gécseg 2013Szűcs 2017). Note that it has already been raised in the Hungarian generative literature that eachquantifiers can be characterized by a topic-like behavior (Surányi et al. 2012, 64;Surányi & Madarász 2017, 44-46), besides their comment-internal function as the "orthodox" view. Even Varga (2016) provides a puzzling analysis: The first pitch tier belongs to the construal according to which the intonation structure of the given sentence consists of a single Comment IP, with the quantified expression occupying the default Half Fall intonation contour at the beginning of the comment. The second pitch tier is claimed to have been formed by replacing the Half Fall with a Rise, which is an IP-closing contour, and thereby a structure that consists of two intonation phrases emerges. Instead of claiming that in this case there are two Comment IPs, it is at least as plausible to say that there is an interpretation (with the usual Topic IP and Comment IP) according to which referring to the musicians serves the purpose of collecting the participants of whom it is predicated that they were sent up to the attic. Note that the first pitch tier with a single Comment IP containing two Half Fall intonation contours can be derived from a pitch tier in which the Comment IP consists of a single Full Fall (Varga 2016, 62).
A read-aloud-experiment with the participation of 41 non-linguist native Hungarian subjects was established to investigate the question empirically as to which pre-V part of the sentence each-quantifiers and also-quantifiers belong to, in Hungarian. The hypothesis was that both each-quantifiers and also-quantifiers can also serve as a focus/comment and non-contrastive topic based on a two-dimension operator model. The one dimension captures the topiccomment features, mentioned above, and the other captures how these expressions also divide the logical space considering the implicit set of other participants (cf. Szeteli & Alberti 2018b, 163-166).
The appropriate readings of the eachand also-quantifiers were embedded into text situations, shown in (3) and (4), and their relevant sequences ((3b), (3d) and (4b), (4c)) were cut out and analyzed by Praat. (3) Test situation with quantifier expressions functioning as a topic a. Köszönöm érdeklődő kérdésedet! Úgy jellemezném a barátaimat, hogy Anti imádja a teniszt, Béci szeret sakkozni és pingpongozni, 'Thank you for your inquiring question! I can characterize my friends as follows: Anti likes tennis, Béci gladly plays chess and The participants told the conductor of the experiment that they are prepared for reading out after studying the dialogues for 1 or 2 min. They were asked to speak naturally since the goal of the research was imitating spontaneous speech as truly as possible. The instructions were shown above the test situations, namely (i) reading out every single word in the given word order as precisely as possible (ii) playing the role by using a tone which is not monotonous, considering the given context and (iii) also considering that the punctuation is not completely given (dots and question marks have been used, but every element of punctuation referring to additional tonal features was avoided) to force the participants to read out the sentences following their individual intuition. We summarized some information in a table, relevant to the methodology (see Appendix) of both experiments in Sections 3 and 4. The duration values of the phrases and the pitch values of the vowels of the first syllables were measured to find out if the phrases containing a quantifier were stressed in the comment but less stressed in the topic reading.

A "static" hypothesis
In Fig. 6, the pitch tier model presented in Fig. 1 in Section 1 is repeated with its relation to the tested sentences. On the left-hand side it is shown how the pitch characteristics change during

CSAba
is szeret ping' És MINDhárom barátom odavan Fig. 6. Differences in the pitch characteristics of the first syllables of the words in the phrases containing the is-quantifier (on the left) and the mind-quantifier (on the right) in topic (marked with blue) and comment reading (marked with orange) the sentences containing the is-quantified noun phrases in both readings. On the right-hand side, those features of the other sentences containing a mind-quantifier are visualized. The pitch data on syllable CSA-in Fig. 6 the vowel of the first syllable of the quantified noun phrase is remarkably higher pitched in the comment readingstrenghten our following plausible global hypothesis on the division of labor between pitch and duration in Hungarian declarative sentences. If, and only if, segmentally identical expressions belong to different zones of Varga's pitch-tier model, their acoustic differentiation is primarily due to their difference in pitch.
For the statistical tests also a smaller subgroup (n 5 31) of the participants (n 5 41) was investigated. The excluded outliers did not pronounce all elements of the relevant fragments or the identification of pitch values was difficult due to their irregular voicing (see McGlone 1967;Blomgren et al. 1988cited by Markó 2013). The pairs were tested with Friedman's Two-Way ANOVA combined with the Wilcoxon post hoc test. The differences proved to be significant for the pitch values of the is-quantifier (P 5 0.000 by n 5 41, and also by n 5 31); let us formulate this significant difference in this way: CSA Top << CSA Com .
In the case of the universal quantifier, however, it is not true that the vowel of the first syllable of the quantified noun phrase (MIND-) is significanty higher pitched in the comment reading: the hypothesized [MIND Top << MIND Com ] relation could not be verified. This result is not surprising on the basis of the visual impression provided by the relevant curve in Fig. 6. 8 Theoretical syntacticians' stance according to which the first accented syllable of the comment zone is some kind of peak of the sentence expresses an idealized "functional" situation, 9 in whose formal background there is presumably a complex division of labor among pitch, duration and intensity with a high level of freedom in realization. What Varga's selected pitch tier suggests is also only that there tend to occur no significantly higher pitched syllables than the first syllable of the comment zone. It is somewhat puzzling that 'the bottom pitch of the speaker's normal voice' is considered as a usual benchmark in Varga's (2016) model while there is no analogous 'top pitch' applied. The pitch tier in Fig. 1 suggests that both the first accented syllable of the Topic IP and the first accented syllable of the Comment IP might be identified with a constant top pitch typical of the speaker.
All in all, the facts [CSA Top << CSA Com and MIND Top > MIND Com ] does not support the strong hypothesis that a quantifier is significantly 'more accented' (in the above defined sense) in the Comment IP than in the Topic IP (X Top << X Com ). Only anundoubtedly unsatisfatorily weakhypothesis can be retained: [X Top >> X Com ] never occurs.

A "dynamic" hypothesis
If the accented syllable of a Half or Full Fall intonation contour can be realized on the speaker's top-pitch level in not only the Comment IP but also the Topic IP, then we should give up 8 Moreover, it happens to be true that MIND Top > MIND Com (but [MIND Top >> MIND Com ] is false). The curve in Fig. 6 suggests that the "accident" (what qualifies to be an accident from the point of view of the strong version of our hypithesis) is to be attributed to the sentence-initial connective ÉS, which takes the role of the indication of the targeted relation from the first syllable of the quantifier: ÉS Top < ÉS Com . 9 The given "peak" is claimed to serve by É. Kiss (2002, 11) as "the heaviest grammatical stress in the sentence". The somewhat enigmatic attribute 'grammatical' and the associated footnote 8, according to which "by 'grammatical stress' I mean stress not affected by pragmatic factors," can be regarded as disavowing any responsibility that would be implied by committing to an explicit, measurable version of the interpretation of 'heaviest stress'. attempting to base the pointing out of significant differences between the two intonation phrases on syntacticians' 'heaviest stress' (É. Kiss 2002, 11) impression. What offers a promising alternativea similar one to Genzel et al. (2015: 196) observationis IP-internal dynamism of acoustic values, that is, to seek significant difference in the speed values of falling in Half/Full Fall intonation contours.
First of all, however, the relation between the three types of acoustic features should be clarified. The correlation analysis (executed by SPSS) provides an explanation for the fact that the concept of accent is not defined purely on the basis of voice frequency, that is, pitch; see Figs 2 and 3. We provide the correlation values among the three acoustic features associated with the types of occurrence of the accented syllables CSA (Fig. 2) and MIND (Fig. 3) and the changes in the three acoustic features within their intonation contours. That is, in the latter case, the difference between the corresponding acoustic features was considered as a basis of the calculation of correlation in the case of the pairs 〈CSA, IS〉 and 〈MIND, ROM〉, in order to capture what is referred to as 'speed' above ( Table 2).
As can be seen in Figs 2 and 3, the correlation is uniformly positive (there does not appear even weak negative correlation), in each combination investigated. Thus, higher pitch, greater intensity and longer pronunciationin the case of both the accented syllables and the (two times two) speed values under investigationcome together, pulling each other to the same direction. 10 The correlation-strength values vary from very weak (but still positive) ('0.0') through weak ('0.1') to medium ('0.3'). This means that pitch, intensity and duration in part express other factors (e.g. attitudinally relevant factors or speaker/context-dependent characteristics) than 'accent' (see Section 2) but in part they do express the characteristics of the pitch tier widely assumed to decode syntactic/information-structural information. Listeners thus can detect accent and falling speed on the basis of any of the three acoustic features, or rather, by considering their co-operative activity. It is postponed to future research to detect which Table 2. Correlation between duration (d), pitch (p) and intensity (i) values associated with the syllable CSA and the difference between the corresponding values of CSA and IS (see examples (3-4)) if these syllables are assumed to be in a Topic IP (3b)/Comment IP (4b) '.3 p ' refers to the fact that in the correlation matrix provided by SPSS the corresponding value (e.g., 0.411) is to be evaluated as a significant (95%) medium level Pearson correlation. The asterisk marks the given level of significance. '.1' conveys 'weak correlation'. acoustic variable serves as the leading influence in the background of the other two. As can be seen, the distribution of correlation values in the submatrices of accented syllables CSA (in Fig. 2) and IS (in Fig. 3) does not provide a uniform picture. In the case of the falling-speed values (see the submatrices that belong to CSA-IS and MIND-ROM), however, there are less vague tendencies: there is a prominent co-operation between pitch and intensity values, scarcely followed by duration values. This suggests the following hypothesis: the well-known Hungarian post-focus stress elimination/deaccentuation (É. Kiss 2002, 77;Varga 2016, 61-62) and, generally, the falling speed of Half/Full Fall intonation contours are acoustically marked by highly correlating pitch and intensity values, with negligible influence of duration (Table 3).
The level of significance is always 95%. The difference between relations '<<' and '<<<' is that in the former case the significance of the given difference can only be verified by Loftus & Masson's (2014) complex method while in the latter case the significance of the given difference can directly indicated by confidence intervalls.
With all this in mind, let us consider Table 4 below, in which we provide the results of numerous pairwise comparisons, such ones as those presented in Figs 7 and 8 as an illustration of our comprehensive statistical analysis (Fig. 9).
The data in the upper 3 3 8 subpart of Table 4, and especially the comparison between the columns of CSA and ROM, corroborate the conclusion drawn in 4.1: "static" comparisons between the hypothesized Topic IPs and Comment IPs provide no satisfactory generalizations on their acoustic differences. The falling-speed data in the lower 3 3 8 subpart of the table, however, provides a higly uniform picture if it is accepted that it can be attributed to statistical "noise" (e.g. such differences in context as the cliticization of an ÉS 'and' to one of the quantifiers, 12 besides attitudinal and speaker-dependent differences) that the otherwise highly correlating pitch and intensity values produce significant differences at different points. In the intensity dimension, for instance ÉS Top <<< ÉS Com . It has also been observed by Genzel et al. (2015: 196-197) that it is not always the first element of the comment that provides the highes pitch value.
In the case of the mind-quantifier, it is justied to consider not only the intonation contour of the determiner itself (MIND-ROM) but also the intonation-contour sequence that belongs to the entire quantified noun phrase (MIND-BA, from the constituent mindhárom barátom 'all three of my friends'). The six concerned relations, with three instances of significance, show perfectly consistently that Comment IPs can be characterized by a higher speed of falling in the relevant Fall intonation contours. The same holds for the case of the is-quantifier. In this case, the CSA-IS values provide the acoustic characterization of the entire quantified noun phrase, and they excellently corroborate the 'quicker falling in Comment' hypothesis. The subsequent intonation contour, that of CSA-SZE (from szeret 'like') shows a surprisingly similar character, with the column of CSA-PING showing the radical shift expected earlier on the basis of the fact that szeret and pingpongozni are already VP-internal elements. This delay in shift of the acoustic pattern can be explained as follows: the new piece of information is not 'like' but the 'to play table tennis' choice, relative to the previous conjunction 'to play chess and table tennis'. To sum up Section 4, the crucial difference between Topic and Comment IPs can primarily be pointed out not on the basis of certain acoustic differences between the allegedly 'heaviest stress' in Comment and the peak of the Topic zone but falling speed values in Half/FullFall intonation contours. The former factor is more prone to accidental influences than the latter one.

CONCLUSION
On the basis of our earlier findings, we have formulated the plausible hypothesis that if, and only if, segmentally identical expressions belong to different parts of the pitch-tier model presented in Section 2, their acoustic differentiation is primarily due to their difference in pitch: The hát-variants in the sentence-initial preparatory contour, discussed in Section 3, have been correctly predicted to primarily differ in duration (and not in pitch values). In harmony with the hypothesis, too, is-quantifiers in different pitch-tier parts (Section 4) proved to be distinguishable on the basis of pitch values; namely, the pitch values associated with the (stressed) first syllables of the given quantifier expressions. The acoustic data set which has characterized the mind-quantifiers in our experiments (Section 4), however, suggests that our global hypothesis on the labor between pitch and duration cannot be retained in this simple form. The investigation should be extended to the triplet of pitch, duration and intensity values as well as falling-speed data in Half/Full Fall intonation contours (4.2). Highly correlating pitch and intensity values produce significant differences between certain intonation contours of segmentally identical quantified expressions used as topics/comments.