Journal of Pragmatics 146 (2019) 1e18 Contents lists available at ScienceDirect Journal of Pragmatics journal homepage: www.elsevier.com/locate/pragma Discourse marker sequences: Insights into the serial order of communicative tasks in real-time turn production Alexander Haselow University of Münster, Englisches Seminar, Johannisstr. 12-20, D-48149, Münster, Germany a r t i c l e i n f o a b s t r a c t Article history: Received 5 December 2018 Received in revised form 1 March 2019 Accepted 7 April 2019 This article investigates the linearization order of English discourse markers (DMs) in twoand multipart sequences at the beginning and the end of turns-at-talk in unplanned conversations, e.g. well I mean you know or yeah but actually. Based on statistical measures of the probability of individual DMs to occur first in all possible combinations involving 26 different DMs it will be shown that the order of DMs has a functional motivation and can be interpreted as reflecting the temporal order of communicative tasks that speakers deal with when opening or closing a turn. © 2019 Elsevier B.V. All rights reserved. Keywords: Discourse marker Combinations Turn Conversation Discourse structure 1. Introduction Despite extensive research on the function and uses of discourse markers (DMs) (e.g. Schourup, 1985; Schiffrin, 1987; Lenk, 1998; Jucker and Smith, 1998; Fraser, 1999; Aijmer, 2002; Fischer, 2006; Heine, 2012; Degand et al., 2013), one aspect has not yet been fully understood, namely the factors that motivate the combination and the order of individual DMs in DM sequences involving two or more adjacent elements, e.g. but anyway, well I think or no well but actually, a phenomenon known as clustering (Maschler, 1994), sequencing (Fraser, 2011; Lohmann and Koops, 2016) or simply combining (Fraser, 2015; Pons Bordería, 2018). DMs have been repeatedly shown to frequently co-occur in natural speech in various languages (e.g. Pons Bordería, 2018; Cuenca and Marín, 2009; Dostie, 2013; Cuenca and Crible, 2019); most of these studies discuss the cooccurrence with respect to questions such as the varying degrees of fixation on a cline from simple juxtaposition to the integration of co-occurring DMs under a more holistic combination serving one single function, the scope and functions of DM sequences, distributional patterns, and their treatment in corpus annotation frameworks. As regards the order of English DMs in sequences, Fraser (2011, 2015) focused on the combining of a small subset of DMs encompassing contrastive (e.g. but, however, yet) and implicative (e.g. so, as a result) DMs, exploring which combinations are possible or “acceptable” (e.g. in contrast þ however) and which ones are not (e.g. however þ rather), based on a distinction between “primary” (¼DMs with a general contrastive meaning, e.g. but) and “secondary” (¼DMs with a more specific meaning, e.g. instead) DMs. Specific ordering principles are not identified, but the argumentation implies that primary DMs logically precede secondary DMs, given that the first are relatively unspecific in their meaning while secondary DMs specify the kind of relationship between neighboring units of discourse. Koops and Lohmann (2015) studied the sequencing behavior of the eleven DMs analyzed by Schiffrin (1987), using the Fisher corpus (American English telephone conversations) as their database, and focusing on clause-initial sequences. Based on a statistical measure (a “normalized pointwise mutual E-mail address: [email protected]. https://doi.org/10.1016/j.pragma.2019.04.003 0378-2166/© 2019 Elsevier B.V. All rights reserved. 2 A. Haselow / Journal of Pragmatics 146 (2019) 1e18 information score”) of what they call the “association strength” of each two-part DM combination the authors propose an “optimal sequencing hierarchy for DM sequences” in their set, which is as follows: oh > well> and > so > or > but > because > then > you know > now > I mean (also Lohmann and Koops, 2016). Crible (2017) studied the combination or clustering of different types of DMs with other markers of (dis)fluency, so-called “fluencemes” (e.g. filled and unfilled pauses, repetitions, or truncations), across different registers in a contrastive corpus-based study of spoken English and French. Her study provided insights into the co-occurrence, position and distribution of DMs in combinations with other kinds of fluencemes. Crible finds that the most frequent patterns of combination are various configurations of DMs with filled and unfilled pauses, such as ‘DM þ pause’, of which more than ten specific configurations are attested (e.g. ‘filled pause þ DM’ or ‘unfilled pause þ DM þ filled pause’). Recently, Cuenca and Crible (2019) provided a qualitative analysis of formal and functional features of DM sequences in conversational English, discussing the different degrees of integration of DMs in such sequences, which range from free juxtaposition to fixed composition and several ambiguous and in-between cases. This paper takes a different perspective, looking at DM sequences at the beginning and the end of conversational turns from a more general cognitive and theoretical-linguistic point of view, based on conversational data deriving from spoken English. As regards cognitive aspects, the linear order within DM sequences might reflect the order of communicative tasks to which speakers are cognitively oriented at turn-beginnings and -endings, an idea which I will refer to as the DM Sequencing Hypothesis. This assumption rests upon the fact that DMs express a broad range of procedural meanings (Blakemore, 2002) that provide processing cues for utterance interpretation. With respect to theory building in linguistics, DM sequences provide important empirical evidence relevant for grammatical modeling, given that regularities in the linear order of different functional types of DMs are indicative of functionalegrammatical linearization principles involving linguistic forms that are used as “extra-clausal constituents” (Dik, 1997) outside clause-internal dependency relations. The aim of this article is to offer an empirical investigation of the sequencing behavior of a larger set of turn-initial and -final DMs in spoken English in order to identify general patterns that determine the linear order of DMs, to examine the possible functional motivations behind the linearization principles observed, and to derive general principles that have the potential to converge into a processing-based communicative grammar of structural units produced in real time. The structure is as follows: Section 2 provides a brief description of the approach to DMs and DM sequences taken in this study, Section 3 is devoted to a discussion of turn-beginnings and -endings as communicatively sensitive moments in turn production at which speakers need to deal with a broad range of communicative tasks. The database and method are presented in Section 4, empirical data are provided in Sections 5 and 6 and discussed in Section 7. The article ends with a conclusion and an outlook on further research (Section 8). 2. Discourse markers and discourse marker sequences 2.1. Discourse markers In this study a DM is defined as a lexical expression with particular formal and functional features, which were systematically applied as operational criteria to data deriving from natural speech. Formally, DMs are characterized by syntactic optionality (they can be omitted without affecting grammaticality because they are outside morphosyntactic and semantic dependency relationships holding between clausal constituents), lack of semantic content, and non-truth conditionality (Schourup, 1999). DMs convey procedural rather than lexicaleconceptual meaning (Blakemore, 2002), providing a processing cue on how the unit they accompany is to be understood or interpreted within a given communicative context. Moreover, they exhibit a high degree of conventionalization or grammaticalization (see Crible, 2017). Functionally, DMs situate an utterance within the communicative context in which it is produced in a wider sense, which is congruent with Schiffrin's (1987) description of DMs on five different “planes of talk” (information state, participation framework, ideational structure, action structure, and exchange structure), and Redeker's (2006) definition of discourse operators as expressions that relate a discourse unit to the immediate “discourse context”, whose nature is not restricted to the linguistic context, but also pertaining to relations between nonlinguistic, nonpropositional aspects of discourse (also Diewald, 2013). For instance, DMs may establish relations between action sequences across larger stretches of discourse (see Crible and Cuenca, 2017: 153), as illustrated in (1). (1) 13 14 15 16 17 18 19 20 21 B: C: B: C: B: Caroline are you coming to the pub tonight? no I'm not coming to the pub tonight. [I'm going to Birkbeck.] [why not? ] you SWOT. <laugh> oh yes I know. (.) I have to swot. I'm not clever like you lot. (..) anyway listen so when are we going together then to do this revision. [ICE-GB S1A-090] A. Haselow / Journal of Pragmatics 146 (2019) 1e18 3 As a DM, anyway expresses a shift away from the prior discourse sequence consisting of a teasing action and initiates the beginning of a new interactional sequence, for which B then requests the addressee's attention (listen). The DM so that follows establishes an inferential relationship between prior and upcoming discourse (if C has to swot he will not have time to get together with B and “do the revision”). Such examples show how DMs establish structures in different domains of spoken interaction, e.g. between conversational actions, episodes, or between explicit and implied meanings (see also Hansen, 2006; Gonz alez, 2005; Crible, 2017). As cues for discourse processing, DMs contribute to the development of a coherent mental model of ongoing interaction and discourse by integrating single units of discourse into a unified whole. Given that ‘discourse’, loosely defined as what is said (or written) in a particular communicative context, involves (i) different speakers who need to structure their interaction, (ii) (spoken) text and thus textual relations that require overall coherence in order for emergent discourse to be perceivable as an organized whole, as well as (iii) different minds involved in the processing of emergent discourse that coordinate the processing of all aspects related to interaction, we can distinguish three basic functional domains that form the communicative “matrix” of discourse and within which DMs operate in order to establish structural relations: (i) INTERACTION, (ii) DISCOURSE STRUCTURE, and (iii) COGNITION, as shown in Table 1.1 Note that a single DM may serve a broader set of functions in different contexts (e.g. Fischer, 2000; Crible, 2017; Crible and Cuenca, 2017), but mostly DMs can be related to core domains in which they prototypically operate. Table 1 Functional domains of discourse markers. Domain General Function Examples (i) INTERACTION listen, …isn't it? (ii) DISCOURSE STRUCTURE (iii) COGNITION organizing turn-taking and interaction (e.g. setting up the conditions for successful uptake of an upcoming message) indicating the type of (local, global) relationship between discourse units or between a discourse unit and implied meanings or inferences providing a cue for utterance-interpretation and, anyway, but, so I think, you know The domain INTERACTION (i) includes all functions related to the management of speakereaddressee interaction and the structuration of the periodic change of speaker roles in interaction. DISCOURSE STRUCTURE (ii) refers to the relationship between discourse units, which encompasses ideational (semantic) relations (e.g. contrast, concession), rhetorical relations (meta-comments on the nature of an upcoming unit, e.g. an inference or a reformulation), and topical aspects (e.g. topic shift, lez, 2005; Redeker, 2006; Crible, 2017). The third domain COGNITION (iii) encompasses DM topic resumption) (see also Gonza functions relating to the alignment of the co-participant's cognitive states and thus serves structuration in terms of jointly constructed mental representations of emergent discourse. For instance, DMs may refer to (presumed) shared background knowledge, inferential abilities, or provide a cue as to how a message is to be interpreted and integrated into the common ground (the set of shared knowledge, beliefs and background assumptions, Stalnaker, 2002), e.g. as expressing an opinion rather than a fact (I think). The DM you know, for instance, invites the addressee to recognize the implications and “sequential relevance” of an assertion or an expression (Schourup, 1985: 105), as illustrated in (2). (2) 44 45 46 47 48 49 50 B: A: B: A: B: A: so uhm (..) so (.) are you still on call (.) at work you know¿ (.) no. no. what do you mean on call. you know uhm tran- uhm transplant. transplant call. yes we are […] [ICE-GB S1A-099] As a DM, you know in line 44 indicates that the question implies more than a mere reference to A's current task at work, based on the fact that speaker and addressee share relevant knowledge, thus inviting B to specify or elaborate on his current work. This also holds for the use of you know in line 48, which accompanies a reference that lacks expressive precision (transplant) and marks it as being linked to what is already shared and thus related to additional implications (see Jucker and Smith, 1998: 191e197). Based on these considerations, the term ‘discourse marker’ as it is used here encompasses a broader range of expressions often referred to under specific categorical labels in the literature, such as interjections (e.g. oh), linking adverbials/final particles (e.g. final then or though), general extenders (e.g. and stuff), parentheticals/comment clauses (e.g. I think), or tag 1 A similar distinction is made by Maschler (2009: 117), who categorizes DMs into three classes: textual, interpersonal, and cognitive. 4 A. Haselow / Journal of Pragmatics 146 (2019) 1e18 questions (e.g. isn't it?), all of which fulfil the formal criteria given above and serve the integration of a message into the communicative context in which it is produced. 2.2. Discourse marker sequences ‘DM sequences’ are defined as a combination of two or more non-identical, immediately adjacent DMs (thus excluding discontinuous co-occurrences of two DMs), provided that each DM is also used outside such sequences as an independent form. Sequences of identical markers may functionally differ from the single use of the DM (e.g. oh oh vs. oh or well well vs. well), and thus be more likely to trigger holistic processing under a different, non-compositional idiosyncratic meaning. The problem of compositionality also refers to sequences of DMs that are not identical, given that some DMs occur frequently together and have developed some degree of idiomaticity as a result of their strong mutual association (Cuenca and Marín, 2009; Cuenca and Crible, 2019). For instance, Schourup (2001: 1031) finds that oh well has become “conventionalized” to “indicate resignation”. The difference between more or less idiomatic and thus non-compositional uses of DM sequences and free, serial combinations, is a gradual one and difficult to determine in many cases. Since all of the DMs analyzed in the present study are regularly in use as autonomous lexemes and since all of the combinations allow for compositional readings they will all be treated as representing free, serial configurations. Several proposals have been made as regards the motivation behind DM sequencing, which can be referred to as (i) floorholding, (ii) functional specification, and (iii) functional complementation. The first proposal assumes that DMs often co-occur for no specific function as a device to hold the floor and bridge a gap during a cognitive planning pause, the DMs having little function themselves (Aijmer, 2004: 186). Under this view, sequences such as okay so or well you know would have to be analyzed as largely redundant and merely serving as a communicative strategy to bridge disfluencies during real-time speech processing. (ii) assumes that DMs occur in sequences because one DM specifies the function of the one with which it co-occurs. Examples are usually provided for DMs expressing a contrastive relationship (Oates, 2000; Fraser, 2015): sequences such as but nevertheless are explained as being motivated by combining the vague meaning of but, which indicates some kind of contrast in a general way, with the more specific meaning of nevertheless, which indicates a concessive relation. Under the third proposal (iii) it is assumed that DMs co-occur because the speaker combines different communicative functions (e.g. Cuenca and Marín, 2009), which would account for the large number of cases where the DMs involved clearly have separate functional values as they indicate relations on different planes of talk, e.g. the textual (e.g. but, so) and the interpersonal plane (e.g. you know, actually). Pons Bordería (2018), using a classification of DMs based on textual, interpersonal, and modal functions for the study of DM combinations in spoken Spanish, observed that DMs with textual functions are preferred in such combinations. 3. Discourse markers at turn-beginnings and -endings: functional aspects In the present study the focus will be on DM sequences at turn boundaries, i.e. at the beginning and end of turns, which means that other positions and factors motivating the use of DM sequences are not explicitly dealt with here. This choice is motivated by the fact that at turn boundaries speakers need to deal with a large set of generic communicative tasks, given that these points represent periodically arising transitional points as regards speaker roles, ideas (expressed by different speakers), and actions, and are thus crucial for the development of discourse. Hence, the need for maintaining order at all levels of the communicative system is very high at such points. An example for the preferred uses of DMs and DM sequences in such contexts is given in (3). (3) 186 187 188 A: 189 190 191 192 193 A: B: A: B: B: yeah. well I was only there for a week so it wasn't as if I was going to get anything. no but actually you weren't supposed- you were advised not to drink water in Leningrad because they they have this special bug (.) that's only found there and in Wolverhampton. it's amoeba isn't it? amoebic. amoebic dysentery yeah. but it did happen. Warren Martin got it from our group […] [ICE-GB S1A-014] At turn-beginnings, both speakers deal with different tasks that can be assigned to the three functional domains given in Table 1, such as acknowledging prior talk (yeah), which refers to structuration in the domain of INTERACTION, or various discourse-structuring tasks such as projecting a shift away from what the prior speaker said (no, see also Lee-Goldman, 2011) toward a new focus (but in line 188) that is potentially unexpected for the addressee (actually), or a contrast to what has just been said (but in line 192). At the end of a turn, a different set of communicative tasks becomes relevant, such as eliciting addressee response (isn't it?) or retrospectively marking what has just been said as coinciding with a prior speaker's assertion (yeah), both of which refer to interaction management. The strong tendency for DM sequences to occur at turn-beginnings and -endings has also been observed in other studies (Cuenca and Marín, 2009; Pons Bordería, 2018). A. Haselow / Journal of Pragmatics 146 (2019) 1e18 5 A ‘turn’ is defined here in conversation-analytic terms as a stretch of speech occupied by a single speaker who is recognizably claiming the conversational floor. This definition excludes two cases: (i) in cases of overlaps and interruptions, a turn is only analyzed as such if at least one third of it (measured in words) is produced “in the clear”; (ii) excluded are utterances that serve merely backchanneling functions (McCarthy, 2003; Rühlemann, 2017), i.e. that are produced during or after another speaker's turn and not followed by more talk (e.g. mhm, yeah or that's right), since these are “not construed as full turns, but rather pass up the opportunity to take a turn” (Levinson and Torreira, 2015: 8). They serve a supportive function as “go-on signals” or displays of attention, involvement, comprehension and encouragement, but not as recognizable attempts to adopt the speaker role. However, if a backchannel item is followed by more talk it is included in the present study as part of a turn, as illustrated in (4), where the speaker clearly contributes more content to emergent discourse after an initial backchanneling device (yeah). (4) 39 40 A: B: you could commission prints of yourself. yeah but I can't afford that kind of thing. [ICE-GB S1A-015] In (4), yeah is used as brief reception token expressing a “pro-forma” agreement (Schegloff, 2007) and preparing the ground for upcoming talk after speaker transition. Table 2 Provides an overview of the generic tasks that speakers are faced with at turn-beginnings, which correspond to the three domains discussed above and many of which have been discussed in the literature (e.g. Auer, 1996; Deppermann, 2013; Heritage, 2013; Haselow, 2017: Ch. 4). Table 2 Communicative tasks relevant at turn-beginnings. Domain Communicative tasks INTERACTION DISCOURSE STRUCTURE COGNITION getting/claiming the attention of the addressee dealing with turn-taking issues; indicating entitlement to take the turn responding to prior talk (e.g. signaling uptake, understanding, acknowledgment) indicating the kind of relation to prior talk (ideational, rhetorical, topical) initiating a new conversational action providing interpretive cues for an upcoming message indicating how to integrate an upcoming message into the common ground At turn-beginnings, speakers mainly deal with linking upcoming talk back to prior talk, and with projections, i.e. with the task of pointing into the discursive future in terms of what will follow in upcoming talk and how this is to be processed or interpreted. The preliminary end of a turn is a second crucial moment in ongoing discourse production in so far as it allows speakers to deal with various communicative tasks before potential turn transition (see Table 3). Table 3 Communicative tasks relevant at turn-endings. Domain Communicative tasks INTERACTION DISCOURSE STRUCTURE COGNITION facilitating addressee-response addressing different aspects of the relationship to the addressee (e.g. backchanneling) marking a transition-relevance place and thus legitimizing turn transition retrospective integration of a turn into ongoing discourse (indication of the kind of relation to prior talk) providing a last interpretive cue before potential turn transition interpretive fine-tuning of a message just produced (e.g. in terms of epistemic value, illocutionary force, canceling possible implicatures) The beginning and end of a turn are thus two crucial moments in the linear production of a unit of talk as they “shape the organization of turns-at-talk” (Schegloff, 1996: 54) in a recurrent way. They correlate with DM use since these represent a routinized communicative resource by which speakers deal with the dense network of recurrent generic tasks. As regards DM sequences it appears plausible that in such contexts these tasks follow a particular temporalelogical order, which would explain possible ordering principles underlying the use of DMs in sequences. This assumption derives from the fact that speakers are forced into the temporal emergence of linear structure, i.e. they can deal with only one single task at a time, which requires sequential ordering (which is, of course, highly automated and thus intuitive) that must be based on directionality in the sense that some tasks need to be addressed earlier (basically those that are oriented backward in discourse) than others (e.g. those that are forward-oriented). 6 A. Haselow / Journal of Pragmatics 146 (2019) 1e18 Based on these considerations, the motivation behind DM sequences and the ways in which DMs are ordered in such sequences can be formulated by way of the following hypothesis: Discourse Marker Sequencing Hypothesis The sequential order of discourse markers reflects the temporal logics of communicative tasks to be performed at a particular moment in turn production. This hypothesis forms the background of the empirical study on the linearization order of DMs in sequences presented here. 4. Database and method 4.1. Database The data related to DM sequences are based on the spoken section of the British component of the International Corpus of English (ICE-GB, sections S1AeS2B), which is based on 1,193 speakers who produced 637,682 words. The ICE-GB offers transcripts of spontaneous (unscripted) speech in private and public contexts (e.g. conversations, broadcast discussions, business transactions), and thus comparatively rich material for DM sequences, which are most frequent in unplanned realtime speech produced in interactive contexts. Each single occurrence of a DM sequence had to be inspected manually after a computerized search. Manual inspection was required for two reasons: (i) to separate all instances of DM sequencing occurring at turn-beginnings and -endings from all other contexts in which such sequences often occur and where they may serve other functions (e.g. within turns), and (ii) to eliminate all cases in which the computerized search delivered results that are clearly not interpretable as DM sequences, e.g. when then is used as a time adverb rather than as a DM in a sequence, when because is followed by of (e.g. well because of…) and thus clearly a preposition, or when you know is followed by a complement clause and thus not used as a DM, but as a matrix clause. 4.2. Method: transitional probabilities The relative position of an individual DM x in various possible turn-initial and -final sequences xy involving any other DM y from a set of DMs was measured on the basis of direct transitional probabilities (DTPs, Gregory et al., 1999; Kapatsinski, 2005) between x and all ys. DTP is a probabilistic measure developed for bigrams that rates the likelihood of a given word to be followed by another word, and thus of the association strength between two consecutive words. The measure was applied here to each individual, two-part DM combination in order to determine the likelihood of a given DM to be followed by other DMs. DTP is defined as the frequency of an individual DM sequence xy divided by the frequency of the first DM x in all turninitial or -final sequences (whether as first or as second element), as shown in (5). Thus, DTP as used here does not measure the overall probability of x to be followed by y in the corpus, but only the probability of x to be followed by y in sequences of DMs (bigrams) at the beginning or the end of turns. (5) DTP ¼ FrequenySequence xy FrequencyDM x in all sequences at turnbeginnings or endings For instance, well co-occurs with I think and is the first element in the combination well I think at the beginning of turns 49 times in the ICE-GB. Generally, well occurs 296 times as part of a combination with any of the 15 turn-initial DMs included in the present study (see section 5.1), either as a first or a second element. Thus, the DTP for the combination well I think is 49/296 ¼ .1655, i.e. the probability of well being the first element followed by I think at turn-beginnings is .1655. The reverse order, I think well, is attested only once among the 110 tokens with I think as the first element in all DM sequences, resulting in a DTP of .009. Thus, the order well I think is more likely than I think well, i.e. in this DM combination well has the tendency to precede I think. If we aggregate all bigram DTPs for a single DM in all possible two-part combinations including this DM, we arrive at a general DTP value for an individual DM that indicates its overall probability to be followed or preceded by other DMs. A high overall DTP would thus indicate a high probability for x to precede all ys in sequences, i.e. such DMs are more likely to occur first in DM sequences. All DTP values are distributed over a [0,1] interval, with values close to 1 indicating high probability. Note that in order to reach highest accuracy all DTPs are based on token frequencies rather than on normalized frequencies (especially since some sequences are very rare, and yet indicative of what speakers do). A particular sequence xy may be part of an even longer sequence xyz (e.g. well I think actually) or wxyz (e.g. yeah well I think you know), but in any case the direct transitional probabilities measured for x in all xy combinations would be indicative of x's probability (e.g. that of well) to precede or follow any other DM y. Thus, the DTPs for a single DM in two-part sequences lead to a general DTP value that indicates the probability of a DM x to be followed (or not) by any DM y from the list in Table 4 for sequences of any size. A. Haselow / Journal of Pragmatics 146 (2019) 1e18 7 5. The frequency of discourse markers at turn-beginnings and -endings 5.1. Turn-initial discourse markers Table 4 shows the DMs that were inspected for their use in sequences, indicating the token frequency at turn-beginnings and their overall token frequency in the corpus. All in all 16 DMs were included as these were the DMs that occurred frequently enough to be part of turn-initial DM sequences at least with some regularity in the ICE-GB and with occurrences that were frequent enough for significance tests. Table 4 Frequency of turn-initial DMs in the ICE-GB (spoken sections). 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. Form Frequency turn-initial Overall frequency yeah yes so oh well I mean no but I think OK because and anyway then actually you know 2,520 2,149 1,848 1,596 1,518 1,013 701 567 542 452 446 322 134 109 98 49 2,746 2,531 3,474 1,623 2,955 1,409 2,472 4,134 1,022 558 1,535 12,586 269 1,629 956 1,005 The low figures for some DMs such as and and but in Table 4 may be surprising, given their high overall frequency in speech. However, at turn-beginnings they tend to be less frequent than within turns, which suggests that they are more frequently used to link different turn-constructional units into a coherent turn rather than to initiate a new turn. An example for turn-initial uses of but in sequences is given in (6), where it is used as a device to reshift the focus of the conversation. The talk is a classroom conversation at UCL. B is a student asking the instructor A whether there is a possibility to attend a tutorial in the upcoming term. (6) 018 019 020 A: B: A: you know I think it's not going to be possible actually to to do that next term unfortunately. but I mean if I if I need some help could I still come? yeah but I don't think there'll be formal tutorials. [ICE-GB S1B-015] B uses but to indicate a move away from A's assertion that attenting a tutoral will not be possible, given that B is interested in getting to know whether he might ask for some help in general. After the production of but he adds a second DM (I mean), which indicates that he is about to specify his point. After weakly confirming B's request (yeah), A indicates a mild contrast to a possible implicature in B's turn, which is that the “help” requested might go beyond occasional, specific questions and thus require the formal setting of a tutorial, which cannot be offered. And is usually used to indicate that the upcoming turn continues a prior action or topic, as shown in (7), i.e. it serves as a functionally generalized device to signal continuation in discourse, e.g. the continuation of a story after a brief intervention by the co-participant. It is often combined with so, which indicates a conclusion or result. In (7), A tells B how she and her friend Leo each invented a language that the other one cannot fully understand. (7) 201 202 198 199 200 201 B: A: B: A: B: A: how do you know how to write him. uhm (.) well I've learnt it. from him. yeah. right. and uhm so I've I've written a few short texts in it like the Lord's prayer I've translated. [ICE-GB S1A-015] A second speaker may also use and to continue or add to a prior speaker's utterance, as in (8), where speaker A combines and with so, which in this case signals that the turn expresses an inference drawn from B's utterance that requires confirmation. 8 A. Haselow / Journal of Pragmatics 146 (2019) 1e18 (8) 23 24 25 B: A: B: and it it faces south and it has big rooms and it's a nice house (.) and, and so the sun comes in. oh yes (..) yes mm. [ICE-GB S1A-031] The figures in Table 4 suggest that the most frequent turn-beginning devices are INTERACTION markers with a transitional function (yeah, yes, no, oh) and markers operating in the domain of DISCOURSE STRUCTURE, indicating a particular kind of progression in discourse, e.g. a continuation or a conclusion (and, so). 5.2. Turn-final discourse markers The forms available for use at the potential end of a turn as attested in the ICE-GB can be categorized into six formal categories: (i) Final particles (FPs) (Haselow, 2012, 2013; Hancil et al., 2015): actually, anyway, then, though (only in final position, where they serve functions that are clearly distinct from their uses in other positions); (ii) General Extenders (GEs) (Overstreet, 1999, 2014; Pichler and Levey, 2011): and stuff, and that, and everything, and so on, or whatever, or something, and all, and all…that sort of thing/…that nonsense/…sorts/…that stuff/…this/ …that); (iii) If-chunks (Brinton, 2008: 163e166): if you like, if I may, if I may say so, if you want, if you don't mind me saying, if you know what I mean; , 2014): I think, you know; (iv) Parentheticals (Dehe (v) Tag questions (Tottie and Hoffmann, 2006, 2009): based on the positive and negative forms of be, can, could, do, have, may, should, and would plus a personal pronoun (e.g. isn't it?, shouldn't he?); (vi) Strengtheners (Haselow, 2017: 183e184): yeah. Table 5 shows the frequency of turn-final DMs from the six categories in the spoken sections of the ICE-GB. Note that for FPs the frequency of all single forms is given since, in contrast to individual forms in all other categories, they are frequent enough to be statistically relevant. Table 5 Frequency of turn-final DMs in the ICE-GB (spoken sections). 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. Tags GEs then though anyway actually I think you know if-chunks yeah Frequency turn-final Overall frequency 405 302 191 162 135 144 95 53 35 23 447 316 1,629 319 269 956 1,022 1,005 35 2,746 An example for DM sequences at turn-endings is (9), where speaker C combines a GE with a tag question. (9) 40 41 42 C: B: well when it came when he came back from America just didn't seem as as though he was, (..) I mean he's got he's got a lot of problems at work and with the gypsies and everything hasn't he. yeah. [ICE-GB S1A-049] The GE and everything marks the turn as providing incomplete, potentially expandible information on the “problems at work”, where “with the gypsies” is only one example from a larger set. The tag question hasn't he serves as a device to elicit a response by the addressee, who is involved into ongoing talk by being invited to contribute. A further example is (10). (10) 174 175 176 A: B: A: Vicky's not awfully good on time. we should maybe just leave a message here saying head over. (.) she won't bother coming then though. [ICE-GB S1A 039] A. Haselow / Journal of Pragmatics 146 (2019) 1e18 9 Here, speaker A combines two FPs, then indicating that the prior message is inferentially related to B's utterance, though indicating that the content expressed in the turn goes against the implication of B's proposal, which is that leaving a note and thus sanctioning Vicky's behavior would change anything. 6. Discourse marker sequences at turn-beginnings and -endings: empirical findings 6.1. Discourse marker sequences at turn-beginnings Table 6 shows the token frequency of each DM combination (upper figures) and the DTPs (lower figures). The table reads as follows: the DMs in the left column are the starting point for combinations with DMs from the horizontal axis, i.e. they form the first DMs x in sequences xy involving DMs of type y from the horizontal axis with the frequencies and DTPs indicated. When the data are read from the top to the bottom, the figures indicate the use of each of the DMs in the uppermost row as a second DM y in a sequence xy. Note that the overall frequency of each DM in Table 6 is much lower than its overall frequency at turn-beginnings (see Table 4), given that all DMs also occur e often much more frequently e as single DMs in this position, and thus outside sequences. Since DTP considers only the frequency of a DM in sequences at turn-beginnings, we are able to level the considerable differences in overall frequency between the different DMs to some extent. This, in turn, accounts more realistically for the psychological effects of DM clustering: combinability and ordering principles should, in principle, be unrelated to the overall frequency of individual DMs in the corpus, but related to the degree to which particular DMs tend to co-occur with other DMs relative to their overall occurrence in combinations. The sums in the rightmost column are aggregated DTPs and thus indicate the probability of a DM x to be used in all xy P sequences as the first element (rightmost column: ‘ [1st]’), that is, to precede all other DMs. The sums in the bottom row P indicate the aggregated DTPs for x not being the first element in a DM sequence (bottom row: ‘ [s1st]’) in the data. The respective cells also show the ratio of occurrences in first (right column)/not first position (bottom row) and the overall occurrence of the respective DM in sequences at turn-beginnings. The data indicate that some DMs have a stronger mutual attraction than others as they enter into combinations more frequently. Some DMs appear to be less combination-friendly (e.g. you know, which co-occurred only with and) than others, and some are much more frequent as first elements (e.g. oh, well) than others (e.g. I think). Almost all two-part combinations show a bias toward a specific serial order as one order (e.g. xy) statistically always predominates over another (e.g. yx). For example, when well and actually are combined, well is only attested as the first element (11 tokens, DTP ¼ .0371), whereas the reverse order (actually well) is unattested. The same holds e.g. for the combination of but and I mean: but I mean is attested with 21 tokens (DTP ¼ .1372), whereas I mean but did not occur in the data. This suggests that there are strong preferences for some DMs to precede and for others to follow the DMs with which they are combined, an observation that is congruent with the findings by Lohmann and Koops (2016) for utterance-initial DMs in a corpus of telephone conversations in English, who state that “it is only rarely the case that a certain ordering and its reversal are both statistically associated combinations” (439). The aggregated probabilities of a DM x to occur as the first element in all possible combinations considered here allow us to rank all DMs according to their likelihood of preceding other DMs in two- or multiple-part combinations. In other words, DMs with a higher aggregated DTP are more likely to be followed by other DMs, and thus more often occurring as the first element in DM sequences than those with lower DTPs. Fig. 1 indicates the aggregated DTPs for each DM as the first element in a turn-initial sequence, showing a positional hierarchy that reflects the probability of any given DM to occur first in a sequence of DMs relative to all other DMs. The asterisks indicate the significance level deriving from a chi-square test, based on observed and expected position-based frequencies in turn-initial sequences.2 Oh has the highest aggregated DTP value (.9593), which means that it occurs almost exclusively as the first element in a sequence of DMs. At the opposite (lower) end we find then, which never occurs as the first element in a sequence of DMs at turn-beginnings. These extreme ends correspond to Lohmann and Koops' (2016) observations: in their ‘network model’, which reflects the co-occurrence and directionality of DMs in a chosen set of two-part sequences, oh always prefers initial position whereas then has the strongest tendency to occur in second position in collocating sequences. The data also show strong similarities to the “optimal sequencing hierarchy for DM sequences” in Koops and Lohmann's (2015) set since those DMs that rank highest, thus having a high probability to precede other DMs, are oh and well, followed by and > so > [or >] but > because, and then and you know forming the lower end. However, given that the authors operate with a smaller set of DMs, further comparisons cannot be made. A look at the upper end of the scale suffices to see that those DMs with the highest probability to occur first serve functions in the INTERACTION domain as they structure the transition from one speaker to another. These DMs are oh, well, OK, yeah, and yes, which express an immediate reaction to prior talk, such as acknowledgement and receipt of information (yeah, okay, yes), change of knowledge state (oh), or that the response options made relevant by a prior turn are inadequate (well, Schiffrin, 2 * ¼ p < .05, ** ¼ p < .01, *** ¼ p < .001. No indication of the significance level is given in those cases where the frequency in one or more of the cells is too low to yield statistically reliable values (which, however, does not necessarily mean that the relation between DTP and relative position in a sequence is insignificant). 10 Table 6 Frequency of DMs in turn-initial sequences and DTPs. 2nd Y 1st / actually actually anyway because but I mean I think no oh OK so then well yeah yes you know 0 0 0 0 0 0 0 0 0 0 0 0 0 1 .0196 0 1 .0196 0 7 .1372 0 2 .0645 1 .0196 0 0 2 .0392 1 .0666 0 1 .0322 1 .0196 0 0 6 .1176 1 .0666 0 0 0 3 .0588 2 .1333 0 2 .0645 0 0 0 1 .0322 3 .0588 0 0 0 0 0 0 0 1 .0088 0 1 .0088 0 1 .0065 3 .0265 0 0 2 .0134 3 .0125 4 .0754 1 .0067 39 .1625 0 4 .0261 6 .0530 1 .009 1 .0067 3 .0125 0 1 .0175 0 3 .0526 0 32 .1081 9 .0388 11 .0371 1 .0043 0 and 0 anyway 0 0 because 0 0 0 but 0 I mean 3 .0196 0 0 6 .0392 0 I think 0 0 0 no 1 .0067 5 .0208 0 2 .0134 0 0 oh OK so then well yeah yes you know ∑ (≠1st) 3 .0526 0 11 .0371 1 .0043 1 .0056 0 25:31 .8064 1 .0188 0 0 1 .0033 9 .0388 12 .067 1 .0303 26:51 .5098 0 0 2 .0177 0 21 .1372 0 0 1 .009 8 .0537 2 .0083 0 7 .0469 7 .0291 0 4 .0701 0 0 7 .0469 0 0 0 2 .035 0 0 28 .1879 7 .0291 6 .1132 0 0 0 5 .0877 0 0 7 .0236 2 .0086 3 .0167 0 3 .0101 36 .1551 22 .1229 0 27 .0912 13 .0560 8 .0446 0 10:15 .6666 21:25 .8400 102:153 .6666 90:113 .7964 2 .0067 0 0 3 .12 7 .0457 8 .0708 1 .0666 0 2 .0130 1 .0088 0 0 1 .0067 45 .1875 0 6 .025 0 0 0 0 0 0 0 0 49 .1655 5 .0216 12 .067 0 37 .125 0 0 0 2 .0086 0 2 .0067 10 .0431 0 0 0 0 3 .0101 8 .0345 9 .0502 0 106:110 .9636 88:149 .5906 4:240 .0166 20:53 .3773 37:57 .6491 1 .04 5 .0327 0 0 0 0 12 .2264 1 .0175 0 0 1 .0065 0 1 .009 3 .0201 47 .1958 9 .1698 0 0 0 0 1 .0065 1 .0088 1 .009 0 72 .3 1 .0188 1 .0175 0 17 .0574 0 24 .1034 9 .0502 0 15 .0837 0 0 26:26 1.0 94:296 .3175 112:232 .4827 88:179 .4916 0 32:33 .9696 P (1st) 6:31 .1935 25:51 .4901 5:15 .3333 4:25 .1600 51:153 .3333 23:113 .2035 4:110 .0363 61:149 .4093 236:240 .9833 33:53 .6226 20:57 .3508 0 0 202:296 .6824 120:232 .5172 91:179 .5083 1:33 .0303 A. Haselow / Journal of Pragmatics 146 (2019) 1e18 and A. Haselow / Journal of Pragmatics 146 (2019) 1e18 11 Fig. 1. Aggregated DTPs showing the probability of turn-initial DMs to be the first element in a DM sequence. 1987: 103), i.e. when the speaker cannot fully endorse the content or implied meanings conveyed in the preceding turn. All these DMs are primarily backward-oriented in the sense that they indicate how the speaker relates to prior talk, e.g. weakly or strongly dis/agreeing or acknowledging receipt of information, before continuing with a conversational “next”. The next group of DMs serve the organization of discourse as they integrate turns into a coherent whole, i.e. they operate in the domain of DISCOURSE STRUCTURE. This holds for most forms referred to a “discourse markers” in the literature (Schiffrin, 1987; Fraser, 1999) and for the lexemes and, actually, anyway, because, but, I mean, no and so in the present study. These DMs are oriented toward an upcoming, new move in ongoing discourse, e.g. a topical shift or contrast (but), a conclusion (so), a move into a different topical direction (no), or the continuation with a new aspect (and).3 In their function of signaling a new move in discourse, they indicate how an upcoming turn is related to the prior turn or a non-verbalized, inferrable meaning deriving from prior talk, marking a boundary between both. The two DMs on the lower end of the scale, I think and you know, indicate how the upcoming turn is to be interpreted, namely as expressing the speaker's epistemic stance when prospectively marking a turn as conveying an opinion rather than a fact (I think) or as inviting the addressee to recognize both the relevance and the implications of the upcoming turn in a given sequential context (you know), i.e. as referring to meta-informational status. These functions clearly belong to the domain labeled COGNITION in Section 2. Based on these observations, the hierarchical organization of DMs according to their tendency to occur first in a sequence shown in Fig. 1 appears to correspond to a particular serialization order of the conversational functions they serve at the beginning of a turn. Table 7 shows the distribution of turn-initial DMs according to their functional domain and ranked according to their DTPs in decreasing order. Table 7 The serial order of turn-initial DMs. INTERACTION > DISCOURSE STRUCTURE > COGNITION backward-orientation forward-orientation forward-orientation 1. 2. 3. 4. 5. 6. and 7. no 8. so 9. but 10. anyway 11. I mean 12. actually 13. because 14. I think 15. you know 16. *then oh well OK yeah yes The model predicts that when several communicative tasks are given expression to at the beginning of a turn, the linearization order corresponds to a particular temporal logics: interaction management (turn-taking, transition from one speaker to another) is logically more relevant at a point in turn production that is closest to the prior turn, namely immediately adjacent, and thus having priority over signaling the kind of continuation, given that backward-orientation to prior talk should occur before going on with any kind of conversational ‘next’. Indicating the type of continuation in discourse (e.g. with a conclusion) relative to the preceding talk, in turn, is more relevant before providing information on how the turn itself 3 Note that no may serve as a DM to express immediate disagreement, but that in DM sequences it is almost exclusively used as a device to indicate a shift on the textual level from what has been expressed or implied before into a different direction, thus serving discourse structure (see example 3). 12 A. Haselow / Journal of Pragmatics 146 (2019) 1e18 is to be understood (e.g. as expressing an opinion, as with I think): the first indicates how the speaker will continue, thus framing the upcoming turn in terms of discourse contingency, whereas the latter provides an interpretive cue as to the character of the turn itself, fine-tuning the addressee's orientation in preparation of the message. There is one clear exception to what looks like a fairly robust tendency, namely the distribution of the DM then. The DTP for then, which clearly has a text-linking function as it creates a relation between two turns based on logicaleargumentative sequentiality (Schiffrin, 1987: Ch. 8), is lower than that of all other DMs operating in the domain of DISCOURSE STRUCTURE and even lower than that of DMs serving functions in the domain of COGNITION, thus following all DMs in DM sequences (e.g. OK then, but then). A possible reason is that the occurrence of then as a DM is very low and perhaps bound to cooccurring DMs in order to distinguish it from the uses as a time adverb, which structures events on a time scale. Generally, the model in Table 7 predicts that those DMs that rank higher (due to higher DTPs) within a particular functional category statistically precede those that rank lower (resulting in sequences such as oh well yeah or but anyway). Note that in this model each of the three functional slots is optional and thus depending on the actual communicativeesequential context in which the production of an upcoming turn is embedded. It does therefore not predict or imply that any turn in spoken English is preceded by three DMs, or by any DM at all. However, it predicts what a DM sequence is most likely to look like when speakers produce at least two DMs in a series. Also note that the model does not indicate what the longest sequence of DMs may be as this depends on the kinds of communicative tasks a speaker wishes to give expression to at the beginning of a turn. A weakness of the model is that it makes predictions on combinations that are not or only rarely attested, e.g. OK you know. This is an effect deriving from the method of ranking DMs in terms of their aggregated DTPs, which generalize over all possible sequences and do not exclude unattested sequences (as these are simply corresponding to a zero value). Likewise, the schema does not account for some patterns that do occur in speech even though they deviate somewhat from the order that derives from the DTPs, such as no I mean so actually you know (ICE-GB S1A-029, 158-159). This is natural, given that the ranking order is based on probability measurements, i.e. it indicates a linearization order that is most probable, based on the combination of the 16 DMs. Probabilities only express what is most likely rather than representing fixed rules that exclude other linearization patterns. 6.2. Discourse marker sequences at the end of turns While turn-beginnings have received much attention in the literature as regards the use of DMs, turn-endings have only recently been subjected to a systematic investigation (e.g. Haselow, 2016, 2017: Ch. 4; Traugott, 2016; Degand and van Bergen, 2018). The study of DM sequences and possible ordering principles of DMs at the end of turns is still untilled soil. Table 8 indicates the frequency and the DTPs of each single DM combination at the end of turns attested in the ICE-GB, based on the set of DMs presented in Section 5. Table 8 Frequency of DMs in turn-final sequences and DTPs. 2nd Y 1st / then then though 0 anyway 0 actually though anyway actually tags GEs I think if-chunks yeah you know 1 .05 0 0 0 0 0 3 .15 0 0 0 7 .35 0 0 0 9 .45 13 .8125 2 .4 0 0 0 0 1 .3333 0 0 0 0 0 1 .3333 0 1 .0434 0 1 .0434 0 3 .1304 0 0 0 0 1 .2 0 0 0 tags 0 0 0 0 GEs 0 0 0 0 I think 0 0 0 ifchunks yeah 0 0 0 you know ∑ (≠1st) 2 .4 0 0 0 0 1 .2 0 9 .3913 2 .4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0:20 0 2:16 .125 0:5 0 1:3 .3333 35:35 1.0 9:23 .3913 2:5 .4 0:0 0 4:4 1.0 0 1 .0625 0 0 P (1st) 20:20 1.0 14:16 .875 5:5 1.0 2:3 .6666 0:35 0 14:23 .6086 3:5 .6000 0:0 0 0:4 0 0:5 0 5:5 1.0 The data allow for the following conclusions. First, DM sequences are relatively rare at turn-endings, i.e. speakers are less likely to combine different DMs at the end of a turn, compared to turn-beginnings. One group of DMs e if-chunks e is even entirely unattested in sequences, i.e. if-chunks always occurred on their own in the ICE-GB. Secondly, some types of DMs A. Haselow / Journal of Pragmatics 146 (2019) 1e18 13 appear to close a turn for further DM production as they are not followed by any other DM (at least they are attested as turnfinal elements in the ICE-GB), namely tag questions, you know and yeah, whereas other DMs are often followed by further DMs, such as FPs and GEs. Fig. 2 shows the order of single DMs according to their likelihood of being followed by a further DM in descending order. Final then and anyway are most likely to be followed by other DMs if they occur in a sequence, given that their likelihood of being the first element in a sequence at turn-endings is highest, whereas e.g. tag questions are not followed by other DMs (hence their probability to be the first element in a sequence is zero). Note that if-chunks were excluded as they were not attested in turn-final sequences. Fig. 2. Aggregated DTPs showing the probability of turn-final DMs to be the first element in a DM sequence. Considering the functional domains within which the different DMs operate we can identify the following pattern. Final then, anyway, though, and actually clearly serve the organization of discourse as they have a retrospective connecting function in the sense that they establish a relation between two adjacent turns. This relational function has been defined as the most important class-defining functional feature of FPs in the literature (Haselow, 2012, 2013; Hancil et al., 2015), which are also referred to as “linking adverbials” (Biber et al., 1999: 889e892): FPs link the turn they accompany to an aspect of the preceding turn and mark the first as being motivated by the latter, hence they operate in the domain of DISCOURSE STRUCTURE. Final then, for instance, typically marks a turn as diverging from the speaker's expectations and as expressing an inference drawn from a preceding turn or discourse segment, as shown in (11). (11) 08 09 10 11 B: A: B: my grandmother was really uhm (.) helpful. who wasyou told her all about it then. yeah I di:d. [ICE-GB S1A-049] The information provided by speaker B leads A to infer that the state of affairs expressed in the turn (“you told her all about it”) is likely to be true. A thus treats B's utterance as a conditional protasis whose truth-value is, however, not hypothetical, but taken as a premise since B said it (‘if, as you say, p, q then?’). Final then establishes a conditionaleinferential link to the preceding turn (Haselow, 2012: 402e404). A second functional class comprises DMs that indicate how the turn just produced, or an aspect of it, is to be interpreted e not in relation to preceding talk, but as to the kind of utterance is represents. Their functional domain is COGNITION. This class includes GEs, the parenthetical I think, and you know. GEs may be used with two functions that can be labeled “referential” and “pragmatic” (Overstreet, 2014). In referential function, they serve as an interpretive cue expressing that there is potentially more to say, i.e. the speaker refers to a set of extra-linguistic entities denoted by the antecedent/s, typically “marking the preceding element as a member of a set” and instructing the listener to interpret this element “as an illustrative example of some more general case” (Dines, 1980: 22e23). Cognitively, they evoke a mental abstraction process leading to a , 2018). The “pragmatic” meanings of GEs are that of a contextually relevant ad hoc category (Barsalou, 1983; Mauri and Sanso hedge on informativeness and that of a “turn-punctor” (Cheshire, 2007; Overstreet, 1999: 103e104, 2014: 112): GEs indicate that a higher degree of precision is not required or intended at a particular point in discourse, and they signal that the turn is to interpreted as potentially complete in spite of informational incompleteness (Overstreet, 2014: 20). I think retrospectively marks an utterance as epistemically weak (Thompson and Mulac, 1991), usually as an opinion, and thus retrospectively changes the interpretation mode of an utterance e.g. from a fact to an opinion or an assumption. The DM is thus used as a hedging device, weakening or mitigating the force of a claim or a comment in such a way that it “leaves room for intervention by the interaction partner” (Nuyts, 2001: 165). The DM you know serves as a device to express the speaker's assumption that, based on shared knowledge or experience, his/her line of thinking or expressing a state of affairs is comprehensible for the addressee, that the latter is able to grasp the implications of a message just produced, and that there is no significant discrepancy between the mental world of the speaker and that of the addressee (Schourup, 1985: 102). 14 A. Haselow / Journal of Pragmatics 146 (2019) 1e18 A third functional category comprises DMs that relate a turn to different aspects of the interaction between the speaker and the addressee. Tag questions facilitate addressee responses in that they offer the addressee to engage in ongoing interaction by eliciting a minimal affirmative response (e.g. hmm, yeah), that is, they invite the addressee to contribute to the discourse (Holmes, 1983; Tottie and Hoffmann, 2009: 146). Other tags are “attitudinal” (Tottie and Hoffmann, 2009: 145) in that they indicate the speaker's attitude, such as disapproval or solidarity, and they may be pragmatically challenging, softening, hortatory, or emphatic (e.g. I've told you not to touch it haven't I?). In both uses, the speaker implies a particular expectation on the side of the speaker, namely that the addressee will agree. Final yeah reconfirms the content of the prior speaker's utterance or an implied meaning, thus providing support for another speaker's utterance and marking a transitionrelevance place. In both cases, the DMs clearly target the addressee and prepare a next step in conversational interaction, potentially opening the floor for next-speaker contribution. If we rank-order the single DMs based on the DTPs in Table 8 and consider their functional value, we arrive at the linearization order shown in Table 9. Table 9 The serial order of turn-final DMs. DISCOURSE STRUCTURE > COGNITION > INTERACTION backward orientation backward-orientation forward-orientation 1. 2. 3. 4. 5. GEs 6. I think 7. you know 8. yeah 9. tag questions anyway then though actually This linearization pattern can, again, be explained with the temporal logics of communicative tasks to be performed at the moment in which they are produced. Discourse-structuring DMs should logically be (and are) closest to the current turn, and thus occur first in a sequence of DMs as they do not prepare a conversational “next”, but relate the turn to prior discourse. They provide a retrospective processing cue helping the addressee to integrate the turn into a mental model of discourse. Once the message has been allocated in ongoing discourse as being related to a preceding turn in a particular way, tasks relating to turn-interpretation or the interpretation of a part of it may e and often do e become relevant, which involves DMs operating the domain of COGNITION. Finally, speakers deal with tasks relating to a conversational ‘next’, i.e. which prepare an upcoming interactional step. Such interactional DMs are usually not followed by other DMs at turn-endings as they serve as the strongest indicators of a transitionrelevance place and are thus maximally relevant for interaction management, more precisely the distribution of speaker roles. 7. Discussion The analysis of the 26 DMs has shown that there is a strong bias toward specific orders when DMs occur in sequences at turnbeginnings and -endings. This suggests that turn-beginnings and -endings as “extra-clausal” parts of an utterance have a syntax of their own, which provides evidence for the existence of a component of grammar that encompasses forms that are not involved in clause-internal dependency relationships and that serve the structuration of language on a macrolevel. Such a grammar could be referred to as macrogrammar (Haselow, 2016, 2017) and be described in terms of linear (rather than linearehierarchical) ordering principles, based on the time course of communicative tasks to be performed in real-time speech production. The sequentiality of such tasks predicts that any given DM from a particular functional domain serving task x precedes a DM from a functional domain serving task y if giving expression to x is logically more relevant before expressing y. For the 26 DMs analyzed above, the temporalelinear order of communicative tasks at turn-beginnings and -endings that accounts for the highest probability of DM ordering is as shown in Fig. 3. The direction of the arrows indicates the main orientation of the respective tasks, which are either oriented backward (to a prior turn or a message just produced) or forward (to an upcoming turn or a next speaker). Fig. 3. Sequentiality of communicative tasks underlying the linear order of DMs. The order reflects a functional motivation underlying a speaker's cognitive orientation in discourse processing, and thus lends support to the DM Sequencing Hypothesis: the extreme ends of a turn (i.e. the very initial and very final slot) tend to serve the production of DMs that display orientation to the preceding or upcoming speaker and serve functions in the domain of INTERACTION. The slots following (at turn-beginnings) or preceding (at turn-endings) these DMs functionally serve tasks oriented toward the turn itself, namely its integration in surrounding discourse (within the functional domain of DISCOURSE STRUCTURE) and its interpretation, e.g. in terms of epistemic certainty (¼COGNITION). In this sense, the order of DMs within A. Haselow / Journal of Pragmatics 146 (2019) 1e18 15 sequences at turn-beginnings and -endings is, in some way, analogous: tasks related to another speaker's turn or participation are dealt with in the transition zones from one turn to another and from one speaker to another whereas tasks referring to the turn itself are closer to the message expressed in a turn if the full spectrum of different DMs is used. However, note that discourse (textual) organization follows interaction-related activities at the beginning of turns, but precedes all other functions at the end of turns, where the DMs serving this function occur first and thus closest to the turn just produced. A possible explanation is that since textual linking has an important framing function as it situates a turn in discourse-so-far, it occurs earliest (after backwardoriented tasks) when an upcoming turn is supposed to be established in order to provide an early processing cue, and earliest after the message of a turn has been produced (at the end of turns) in order to allow for the immediate integration of the turn into surrounding discourse. Finally, what is relevant for interpreting a turn itself follows discourse-organizing tasks. Coming back to the linearization principles discussed above we can now relate the conclusions deriving from the present study to the three main proposals concerning the motivation behind DM clustering that have been made in the literature (see Section 2.2). First, it has become clear that DM sequences are not merely strings of functionally similar DMs with a floor-holding function and “little function in themselves” (e.g. Aijmer, 2002: 31, 2004: 186). Given that DMs serve specific communicative functions, especially at turn-beginnings and -endings where speakers need to deal with a large set of generic tasks, the idea that DM sequences are merely a strategy to stall for time and to hold the floor and thus largely devoid of a particular function is untenable, even though occasional uses of DM sequences as a device to bridge a disfluency are, of course, possible. The data also make clear that co-occurrence in two- or multiple-part sequences does not only affect DMs of the same functional type (although such co-occurrences exist). Thus, sequencing is not merely motivated by one DM specifying or strengthening the meaning of the one with which it co-occurs, as has been suggested before e.g. by Oates (2000) or Fraser (2015). Most of the combinations studied here defy this explanation, given that the highest DTPs are attested with DMs that are functionally dissimilar (consider e.g. the high DTPs for sequences such as yeah I think or yeah but). In view of these observations, it appears more promising to translate the frequencies of co-occurrence and the ordering principles of DMs into empirical predictions on the types of communicative tasks that are often executed together at turnbeginnings and -endings, and their linearetemporal order. As shown above, it appears that these tasks are linearized according to their relation to surrounding discourse: what is relevant for structuring interaction (e.g. turn taking) occurs at the fringe of turns, interpretive cues for understanding the type of turn and its link to prior discourse are produced in close vicinity to the message itself. It is important to note that (i) the expression of the different communicative tasks described above is not required in all contexts and that (ii) DMs are not the only means to deal with these tasks, which explains why the extent to which speakers make use of DMs is always subject to contextual conditions. For instance, where the explicit indication of turn reception or of the link between turns is irrelevant, no use is made of DMs from these categories, e.g. when speakers complete another speaker's turn, as in line 15 in (12). (12) 12 13 14 15 16 17 / A: B: A: B: D: I I met her and <<unclear words>> you PIcked her up and DROPPed her did you . picked her up and dropped her yeah. only to pick her up again¼ ¼ I hope. (.) oh well if you've got her telephone number you picked her up. [ICE-GB S1A-020] It goes without saying that the functional categories for the DMs investigated in the present studies are rather broad. However, a more fine-grained differentiation is fraught with difficulties. The reason is that many DMs are used with subtle functional differences so that the categorical frame must necessarily be relatively gross. For instance, many DMs indicating the type of textual link between two turns do so on different dimensions, e.g. in terms of propositional or discursiveerhetorical relations. The DM but, for example, can be used to introduce a contrast to what has been expressed or implied in the prior turn, and thus to indicate a relationship on the propositional level, but it is also used on the discourseorganizing level to introduce a shift in discourse, e.g. a shift away from what has been discussed thus far (Bell, 1998). This is the case in (13): speaker D seeks to close a topic as he has no further information to contribute, rather than expressing a contrast to the content expressed in the preceding turn(s). (13) 104 105 106: 107 108 B: C: B: D: she's doing a PGCE.4 yes. [pathetic.] [sorry, ] but that's all I know about it really. [ICE-GB S1A-040] 4 PGCE ¼ Post-graduate certificate in education. 16 A. Haselow / Journal of Pragmatics 146 (2019) 1e18 A further discourse-organizing use of but is that of a topic shift, as illustrated in (14), where the speaker does not produce a counterview, but seeks to shift the addressee's focus of attention to a different aspect of a topic under discussion. The speakers talk about reading books, D saying that he can hardly stop reading books once he has started a new one. (14) 237 238 239 240 A: D: A: I share your view but I just wonder why you think that's good. well I s- I suppose it's the writer (.) [that uh] [gets you so involved.] [yeah ] [but what you- ] what are you getting out of it. (.) [ICE-GB S1A-016] Again, the contrast projected by turn-initial but is to be found on the discourse rather than the propositional level as the speaker does not argue against D's assertion, but focuses on a different aspect (the general benefit or gain resulting from binch reading). So given the subtle functional differences of single DMs, which may act on several “planes of talk” (Schiffrin, 1987), the functional categorization suggested in this paper is only a gross simplification. Yet, the rather general categorical system yielded interesting results on the systematic ways in which DM are serialized and can thus serve as the basis for more finegrained analyses. 8. Conclusions and outlook The empirical analysis of DM sequences has shown that there is a strong tendency toward a particular serial order when DMs occur in sequences at the beginning and the end of turns. The order of DMs appears to reflect the temporal order of communicative tasks that are relevant for speakers at a given moment in interactive speech production, and thus offer insights into minute (perhaps serialized) cognitive processes involved in speech processing. The findings thus provide evidence for the DM Sequencing Hypothesis formulated earlier in this article. A promising step in finding further evidence for this hypothesis would be to expand the number of functional categories, based on a more fine-grained categorial system of DM functions, in order to sharpen our understanding of the serialization order of certain communicative functions and subfunctions in real-time speech production (see e.g. Crible, 2018, who annotated DMs in English and French following a fine-grained taxonomy). Furthermore, more data from comparative, crosslinguistic research are needed to explore the possible existence of more general cognitive processes underlying the stepby-step production of turns, independent of individual language systems. Given that conversation is interactive and, as far as we know, organized in adjacent turns in any language (Levinson, 2015), it is not unlikely that the communicative tasks that speakers need to deal with at turn-beginnings and -endings, as well as the serial order in which they are addressed, form some kind of “communicative universal” underlying human spoken interaction. Transcription conventions [] ¼ (.) (2.0) :,:: rea(hh)lly ABsolutely really word . , ? ¿ overlap and simultaneous talk latching micropause measured pause segmental lengthening according to duration laugh particles within talk strong, primary stress via loudness stress via pitch or amplitude produced softer than surrounding talk falling intonation (terminal pitch) continuing intonation rising intonation a rise stronger than mid-level but weaker than high-terminal pitch References Aijmer, Karin, 2002. English Discourse Particles. Evidence from a Corpus. John Benjamins, Amsterdam. Aijmer, Karin, 2004. Pragmatic markers in spoken interlanguage. Nordic Journal of English Studies 3 (1), 173e190. Auer, Peter, 1996. The pre-front field in spoken German and its relevance as a grammaticalization position. Pragmatics 6 (3), 295e322. Barsalou, Lawrence W., 1983. Ad hoc categories. Mem. Cognit. 11 (3), 211e227. Bell, David, 1998. Cancellative discourse markers: a core/periphery approach. Pragmatics 8 (4), 515e542. Biber, Douglas, Johansson, Stig, Leech, Geoffrey, Conrad, Susan, Finegan, Edward, 1999. Longman Grammar of Spoken and Written English. Pearson Education, Harlow. Blakemore, Diane, 2002. Relevance and Linguistic Meaning: the Semantics and Pragmatics of Discourse Markers. Cambridge University Press, Cambridge. Brinton, Laurel J., 2008. The Comment Clause in English: Syntactic Origins and Pragmatic Development. Cambridge University Press, Cambridge. Cheshire, Jenny, 2007. Discourse variation, grammaticalisation and stuff like that. J. SocioLinguistics 11 (2), 155e193. A. Haselow / Journal of Pragmatics 146 (2019) 1e18 17 Crible, Ludivine, 2017. Discourse markers and (dis)fluencies in English and French: variation and combination in the DisFrEn corpus. Int. J. Corpus Linguist. 22 (2), 242e269. Crible, Ludivine, 2018. Discourse Markers and (Dis)fluency. Forms and Functions across Languages and Registers. John Benjamins, Amsterdam. Crible, Ludivine, Cuenca, María-Josep, 2017. Discourse markers in speech: characteristics and challenges for corpus annotation. Dialogue and Discourse 8 (2), 149e166. Cuenca, Maria Josep, Marín, Maria Josep, 2009. Co-occurrence of discourse markers in Catalan and Spanish oral narrative. J. Pragmat. 41, 899e914. Cuenca, María Josep, Crible, Ludivine, 2019. Co-occurrence of discourse markers in English: from juxtaposition to composition. J. Pragmat. 140, 171e184. Degand, Liesbeth, van Bergen, Geertje, 2018. Discourse markers as turn-transition devices: evidence from speech and instant messaging. Discourse Process 55 (1), 47e71. Degand, Liesbeth, Cornillie, Bert, Pietrandrea, Paola (Eds.), 2013. Discourse Markers and Modal Particles. Categorization and Description. John Benjamins, Amsterdam. , Nicole, 2014. Parentheticals in Spoken English: the Syntax-Prosody Relation. Cambridge University Press, Cambridge. Dehe Deppermann, Arnulf, 2013. Turn-design at turn-beginnings: multimodal resources to deal with tasks of turn-construction in German. J. Pragmat. 46 (1), 91e121. Diewald, Gabriele, 2013. ‘Same same but different’ e modal particles, discourse markers and the art (and purpose) of categorization. In: Degand, Liesbeth, Pietrandrea, Paola, Cornillie, Bert (Eds.), Discourse Markers and Modal Particles. Categorization and Description. John Benjamins, Amsterdam & Philadelphia, pp. 19e46. Dik, Simon C., 1997. The Theory of Functional Grammar. Part 2. Mouton de Gruyter, Berlin. Dines, Elizabeth R., 1980. Variation in discourse e ‘and stuff like that’. Lang. Soc. 9 (1), 13e31. tane, 2013. Les associations de marqueurs discursifs. De la cooccurrence libre a la collocation. Linguist. Online 62 (5), 15e45 special issue on Dostie, Gae “Forms and Functions of Formulatic Construction Units in Authentic Conversation”. Fischer, Kerstin, 2000. From Cognitive Semantics to Lexical Pragmatics: the Functional Polysemy of Discourse Particles. Mouton de Gruyter, Berlin. Fischer, Kerstin (Ed.), 2006. Approaches to Discourse Particles. Elsevier, Amsterdam. Fraser, Bruce, 1999. What are discourse markers? J. Pragmat. 31 (7), 931e952. Fraser, Bruce, 2011. The sequencing of contrastive discourse markers in English. Int. Rev. Pragmat. 1 (2), 293e320. Fraser, Bruce, 2015. The combining of discourse markers e a beginning. J. Pragmat. 86, 48e53. lez, M., 2005. Pragmatic markers and discourse coherence relations in English and Catalan oral narrative. Discourse Stud. 77 (1), 53e86. Gonza Gregory, Michelle L., Raymond, William D., Bell, Alan, Fosler-Lussier, Eric, Jurafsky, Daniel, 1999. The effects of collocational strength and contextual predictability in lexical production. CLS 35, 151e166. Hancil, Sylvie, Haselow, Alexander, Post, Margje (Eds.), 2015. Final Particles. De Gruyter Mouton, Berlin. Hansen, Maj-Britt Mosegaard, 2006. A dynamic polysemy approach to the lexical semantics of discourse markers (with an exemplary analysis of French toujours). In: Fischer, Kerstin (Ed.), Approaches to Discourse Particles. Elsevier, Amsterdam, pp. 21e41. Haselow, Alexander, 2012. Subjectivity, intersubjectivity and the negotiation of common ground in spoken discourse: final particles in English. Lang. Commun. 32 (3), 182e204. Haselow, Alexander, 2013. Arguing for a wide conception of grammar: the case of final particles in spoken discourse. Folia Linguist. 47 (2), 375e424. Haselow, Alexander, 2016. A processual view on grammar: macrogrammar and the final field in spoken syntax. Lang. Sci. 54, 77e101. Haselow, Alexander, 2017. Spontaneous Spoken English. An Integrated Approach to the Emergent Grammar of Speech. Cambridge University Press, Cambridge. Heine, Bernd, 2012. On discourse markers: grammaticalization, pragmaticalization, or something else? Linguistics 51 (6), 1205e1247. Heritage, John, 2013. Turn-initial position and some of its occupants. J. Pragmat. 57, 331e337. Holmes, Janet, 1983. The functions of tag questions. English Language Research Journal 3, 40e65. Jucker, Andreas H., Smith, Sara W., 1998. And people just you know like ‘wow’ e discourse markers as negotiating strategies. In: Jucker, Andreas H., Ziv, Yael (Eds.), Discourse Markers. Theory and Descriptions. John Benjamins, Amsterdam, pp. 171e201. Kapatsinski, Vsevolod M., 2005. Measuring the relationship of structure to use: determinants of the extent of recycle in repetition repair. Berkeley Linguistics Society 30, 481e492. Koops, Christian, Lohmann, Arne, 2015. A quantitative approach to the grammaticalization of discourse markers. Evidence from their sequencing behavior. Int. J. Corpus Linguist. 20 (2), 232e259. Lee-Goldman, Russel, 2011. No as a discourse marker. J. Pragmat. 43 (10), 2627e2649. Lenk, Uta, 1998. Marking Discourse Coherence: Functions of Discourse Markers in Spoken English. Narr, Tübingen. Levinson, Stephen, 2015. Turn-taking in human communication e origins and implications for language processing. Trends Cognit. Sci. 20 (1), 6e14. Levinson, Stephen C., Torreira, Francisco, 2015. Timing in turn-taking and its implications for processing models of language. In: Holler, Judith, Kendrick, Kobin, Casillas, Marisa, Levinson, Stephen (Eds.), Turn-Taking in Human Communicative Interaction. Frontiers Media, Lausanne, pp. 10e26. €ck, Lohmann, Arne, Koops, Christian, 2016. Aspects of discourse marker sequencing: empirical challenges and theoretical implications. In: Gunther, Kaltenbo Keizer, Evelien, Lohmann, Arne (Eds.), Outside the Clause. Form and Function of Extra-clausal Constituents. John Benjamins, Amsterdam, pp. 417e445. Maschler, Yael, 1994. Metalanguaging and discourse markers in bilingual conversation. Lang. Soc. 23, 325e366. Maschler, Yael, 2009. Metalanguage in Interaction: Hebrew Discourse Markers. John Benjamins, Amsterdam. , Andrea, 2018. Linguistic strategies for ad hoc categorization: theoretical assessment and cross-linguistic variation. Folia Linguist. 52, Mauri, Caterina, Sanso 1e35. McCarthy, Michael, 2003. Talking back: ‘Small’ interactional response tokens in everyday conversation. Res. Lang. Soc. Interact. 36 (1), 33e63. Nuyts, Jan, 2001. Epistemic Modality, Language and Conceptualization: A Cognitive-Pragmatic Perspective. John Benjamins, Amsterdam. Oates, Sarah Louise, 2000. Multiple discourse marker occurrence: creating hierarchies for natural language. In: Proceedings of the 3rd CLUK Colloquium, Brighton, pp. 41e45. Overstreet, Maryann, 1999. Whales, Candlelight, and Stuff like that. General Extenders in English Discourse. Oxford University Press, New York. Overstreet, Maryann, 2014. The Role of Pragmatic Function in the Grammaticalization of English General Extenders. Pragmatics vol. 24 (1), 105e129. Pichler, Heike, Levey, Stephen, 2011. In search of grammaticalization in synchronic dialect data: general extenders in Northeast England. Engl. Lang. Linguist. 15 (3), 441e471. Pons Bordería, Salvador, 2018. The combination of discourse markers in spontaneous conversations: keys to undo a gordian knot. Rev. Romane 53 (1), 121e158. Redeker, Gisela, 2006. Discourse markers as attentional cues at discourse transitions. In: Fischer, Kerstin (Ed.), Approaches to Discourse Particles. Elsevier, Amsterdam, pp. 339e358. Rühlemann, Christoph, 2017. Integrating corpus-linguistic and conversation-analytic transcriptions in XML: the case of backchannels and overlap in storytelling interaction. Corpus Pragmatics 1, 201e232. Schegloff, Emanuel A., 1996. Turn organization: one intersection of grammar and interaction. In: Ochs, Elinor, Schegloff, Emanuel A., Thompson, Sandra A. (Eds.), Interaction and Grammar. Cambridge University Press, Cambridge, pp. 52e133. Schegloff, Emanuel A., 2007. Sequence Organization in Interaction: A Primer in Conversation Analysis. Cambridge University Press, Cambridge. Schiffrin, Deborah, 1987. Discourse Markers. Cambridge University Press, Cambridge. Schourup, Lawrence, 1985. Common Discourse Particles in English Conversation. Garland, New York. Schourup, Lawrence, 1999. Discourse markers. Lingua 107, 227e265. Schourup, Lawrence, 2001. Rethinking well. J. Pragmat. 33, 1025e1060. 18 A. Haselow / Journal of Pragmatics 146 (2019) 1e18 Stalnaker, Robert, 2002. Common ground. Ling. Philos. 25 (5e6), 701e721. Thompson, Sandra A., Mulac, Anthony J., 1991. The discourse conditions for the use of the complementizer ‘that’ in conversational English. J. Pragmat. 15, 237e251. Tottie, Gunnel, Hoffmann, Sebastian, 2006. Tag questions in British and American English. J. Engl. Linguist. 34 (4), 283e311. Tottie, Gunnel, Hoffmann, Sebastian, 2009. Tag questions in English e the first century. J. Engl. Linguist. 37, 130e161. Traugott, Elizabeth C., 2016. On the rise of types of clause-final pragmatic markers in English. J. Hist. Pragmat. 17 (1), 26e54. Alexander Haselow is Assistant Professor of English Linguistics at the University of Münster. His current research focuses on the cognitive, dialogic and neural mechanisms underlying the production and perception of speech in real time, and how these can be integrated into a usage-based model of speech production. He is the author of Spontaneous Spoken English (Cambridge, 2017), co-editor of Final Particles (De Gruyter Mouton, 2015), and he has published various articles on structural and cognitive aspects of spontaneous speech in international, peer-reviewed journals.