Information Structure and Thematic Structure of Persian Language A Corpus-based Analysis

Document Type : Research

Authors

1 PhD Candidate of General Linguistics, Department of Linguistics, Ilam Branch, Islamic Azad University, Ilam, Iran

2 Assistant Professor, Department of Linguistics, Ilam Branch, Islamic Azad University, Ilam, Iran

Abstract

Other things being equal, grammatical subject is coded as the theme of the clause. In marked cases, however, other non-subject arguments and prepositional phrases may appear as the theme due to some processes, including pre-posing. Accordingly, the goals of the present study are three-fold. First, it aims at investigating the information status of grammatical subject as the unmarked theme of the clause. Secondly, it intends to find the marked themes in the non-canonical constructions, including inversion, topicalization, passivization, and scrambling. Preposing as a cover terms includes all these processes. Accordingly, this study is set out to examine information status, syntactic category, and grammatical function of preposed constituents (marked themes). The degree of matching between grammatical subject and theme is also investigated. This study is a corpus-based one in which the data are collected from two different corpora: a movie (Separation by Farhadi) and a football report. The theory of information structure by Prince (1992) revised in Birner (2013) is adopted as theoretical model of the study. Based on this theory, information is classified in terms of discourse and hearer. Accordingly, information status of any constituent can be one of these for cases: discourse old- hearer old, discourse old- hearer new, discourse new- hearer old, and discourse new- hearer new. This classification is applied to the collected themes (both marked and unmarked) of both corpora. Halliday (2004) is another theoretical basis of this study. Different types of themes are taken from Halliday(2004) to be analyzed in terms of information status. The focus in this study is on topical themes (not interpersonal or textual themes). The target themes were, then, coded based on the theoretical model and they were analyzed by SPSS (version 23). In terms of syntactic markedness, the results indicated that in football report, there were more numbers of marked syntactic structures comparing with the film corpus. As for the match between subject and theme, the results indicated a higher amount of match between these two concepts in film corpus comparing with football report corpus. The lower match between subject and theme in football report can be accounted for by the higher number of movements and dislocations in the syntactic category of this type of language.  As stated, the main goal of this study was to investigate the information status of both marked and unmarked themes. As for unmarked theme (subject), the results showed that in film corpus such themes are mainly old (more than 90%) both in terms of discourse and hearer. In contrast, the themes were found to be mainly discourse-old but hearer new (more than 65%) in the football report corpus.  This is due to the different nature of these two corpora. In daily conversations (film), we have topic continuity, while in football the sentences are expressed to refer to different individuals (no topic continuity). As stated, preposed (marked themes) are also investigated. As for information status, there is almost the same pattern as unmarked themes. Most of the marked themes in film corpus are old both in terms of discourse and hearer. In football report corpus, however, the marked themes (preposed themes) are mainly discourse- old and hearer new. This is somehow problematic, since the discourse old entities cannot be hearer-new. Prince (1992) resolved this problem by raising the issue of schemata and set relations. She believed that many referents are implied in the discourse and hence counted as being discourse-old.  She called them inferable. In football report corpus, there were many such inferable marked themes. Another unexpected finding was the high amount of new preposed themes in film corpus which seems to contradict the previous literature (Birner, (1996), for example). Careful examination of literature does not ban the occurrence of new pre-posed items in theme position, although themes naturally have identifiable referents. ‘recency of new-ness/ old-ness proposed by Birner (1996) can account for the emerged pattern in the present study. The results of this study are important both theoretically and practically. Many findings showed that the constraints on preposing are not observed in the two investigated corpora. It implies that Persian language, due to its flexible word order, does not follow the rules of other languages. It seems there is a connection between language type and the kind of the constraints in that language. This can be a suggestion for further study to find the appropriate constraints governing information status of themes in Persian. Another implication of this study is the fact that in the same language, we do not have the same patterns as far as the information status of themes is concerned. As it was found, the information status of themes in film corpus was dramatically different from football report corpus. Therefore, one cannot generalize the results obtained in a study to the whole language.

Keywords


Ahmadi, S., & Gowhari, H. (2019) Clitic doubling and dislocation of direct object based on evidence from Kalhori Kurdish. Language Science, 7 (11), 351-386 [In Persian].
Baker, P., Gabrielatos, C. & McEnery, T. (2013). Sketching Muslims: A corpus driven analysis of representations around the word ‘Muslim’ in the British press.1998–2009. Applied Linguistics, 34(3), 255–78.
Birner, B. J. (1996). The Discourse function of inversion in English. New York: Garland.
Birner, B. J. (2013). Discourse functions at the periphery: Noncanonical word order in English. ZAS Papers in Linguistics 35 (1), 41-62. doi:10.21248/zaspil.35.2004.221.
Birner, Betty J., & Mahootian, Sh. (1997). Functional constraints on inversion in English and Farsi. Language Sciences 18(1-2),127-138.
Brezina, V. (2018). Statistics in corpus linguistics: A practical guide. Cambridge: Cambridge University Press
Gowhary, H. (2011). Information structure in English and Persian and the its translation (PhD dissertation). Allameh Tabataba’i University, Tehran, Iran [In Persian].
Halliday, M. A. K., & Matthiessen, C. (2004). An introduction to functional grammar. London: E. Amold.
Lambrecht, K. (1994). Information structure and sentence form. Cambridge: Cambridge University Press,.
Modaresi, B. (2007). Representation of information structure in Persian (PhD dissertation), Allameh Tabataba’i University, Tehran, Iran [In Persian].
Moezzipour, F. (2019) A functional reanalysis of object clitic doubling in Persian. Journal of Linguistics & Khorasan Dialects. 11(1), 227-259 [In Persian].
Prince, Ellen F. (1981). Toward a taxonomy of given/new information. In P. Cole (Ed.), Radical pragmatics (pp. 223-254). New York: Academic Press.
Prince, E. F. (1992). The ZPG letter: Subjects, definiteness, and information-status. In S. Thompson & W. Mann (Eds.), Discourse description: Diverse analyses of a fundraising text (pp. 295-325). Amsterdam: John Benjamins.
Rasekh Mahand M. (2006). The relationship between syntactic scrambling and focus in Persian. Journal of Dastur, 2, 20-32 [In Persian].
Rasekh Mahand, M. Ghiasvand, M. (2014) A corpus-based analysis of functional factors affecting short syntactic scrambling in Persian. Journal of Dastur, 2, 163-198 [In Persian].
Shahidi, N. (2000). Topicalization in Persian: A functional approach (Master's Thesis). Allameh Tabataba’i University, Tehran, Iran [In Persian].
Ward, G., & Briner, B.J.(2004). Information structure and non-canonical syntax. In H. Laurence., & G. Ward (Eds.) The Handbook of pragmatics (pp. 1934-1963). Oxford: United Kingdom: Blackwell Publishing