Rhythmic units

Revision as of 13:51, 20 May 2006 by Ahans (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

1. Problem and history

A rhythmic unit is according to Marbe (1904) the number of non-stressed syllables between two stressed ones. Some researchers consider it as a whole consisting of a stressed and the following non-stressed syllables. The problem is to ascertain whether the length of rhythmic units abides by a special distribution. The first numerical examinations have been performed by Marbe and Roetteke (1904). Best (2001c) assigns this problem to the “length” problems using the appropriate way of modelling. Brainerd – in another context – considered it a Markov chain. Lehfeldt and Altmann (2003) derive the model from an urn approach considering the Poissonian gap filling between accentuated syllables as a pure birth process with repulsion.

2. Hypothesis

The distribution of rhythmic units follows a regular probability distribution.

3. Derivation

3.1. Best´s approach

Best starts from the usual “length approach” considering the proportionality between frequency classes, i.e.

(1)P_x = g(x)P_{x-1}\quad .

Setting g(x) = a/(b+x) and the necessary displacement which is conventional he obtains

(2)P_{x+1} = \frac{a}{b+x}p_x, \quad x = 1, 2, 3, ...

whose solution yields the 1-displaced hyper-Poisson distribution

(3) P_X = \frac{a^{x-1}}{b^{x-1}_1 F_1(1; b; a)}, \quad x = 1, 2, 3,...

where b^(x) = b(b+1)(b+2)...(b+x-1) and _1 F_1(1; b; a)

is the confluent hypergeometric function. The hyper-Poisson is a special case of the unified theory (\rightarrow) when a_0 = -1, a_1 = a, b_1 = b, a_2 = 0.

Example: Rhythmic units in German

Best (2001c) examined 8 texts processed by Marbe and Roetteke and obtained in 6 cases a corroboration of the model. One of the results can be seen in Table 1 and Fig.1


Tabelle1 RU.jpg
Grafik1 RU.jpg
Fig. 1. Fitting the hyper-Poisson to the data in Table 1


3.2. The birth process with repulsion [[[wo erschien dies???]

Let the gaps between accentuated syllables are considered as urns and the non-accentuated as balls. In time interval h (time is merely an auxiliary variable) either 1 or none ball is inserted in an urn or before the first and behind the last one. Let the assumptions of the Poisson process pure birth process are fulfilled. The urns are not passive but exert influence on the acception of balls, namely the more balls are in the urn, the more the urn repulses new balls: a balanced rhythm requires a restricted number of non-accentuated syllables. Let the birth rate be \lambda_x = n-x. Then one obtains

(4)P'_0(t) = -nP_0(t)\quad

P'_x(t) = (n-x+1)P_{x-1}(t)-(n-x)P_x(t), \quad x = 1, 2, ..., n.

Solving (4) with boundary conditions  P_0(0)=1, P_x(0) = 0, x=1, 2, ..., n and substituting at last e^{-t} = q, p = 1-q we obtain the binomial distribution

(5) P_x ={n \choose x}p^x q^{n-x}, \quad x=0,1,2,...,n

Brainerd (1976) considered sequences of this kind as Markov chains and obtained for the distances chains of different order represented by the modified geometric distribution (see Gap formation). Since in all models of order higher than zero the first class is modified, Lehfeldt and Altmann (2003) modified it a posteriori, too, and obtained the extended positive binomial distribution

(6) P_X =\begin{cases} 1-\alpha & x = 0 \\ \frac{\alpha {n\choose x}p^x q^{n-x}}{1-q^n}& x=1, 2, ..., n  \end{cases}

Evidently, (5) and (6) are identical when α = 1-qn. Example: Intervalls of non-accentuated syllables in Russian

Lehfeldt and Altmann (2003) examined the intervals of non-accentuated syllables in Russian and obtained the results in Table 2 and Fig. 2. Here the rhythmic unit is merely the number of non-accentuated syllables.

Tabelle2 RU.jpg

The result of fitting is satisfactory. Additional pooling had brought still better results.

Grafik2 RU.jpg
Fig. 2. Fitting the binomial d. to the data in Table 2


Example: Rhythmic units in Puschkins Werk “Vystrel”

Lehfeldt and Altmann (2003) fitted the extended positive binomial distribution to the rhythmic units in Puschkins work “Vystrel” and obtained the results presented in Table 3 and Fig. 3.

Tabelle3 RU.jpg


[[[Auch EPPO, Hype, Hpas, EpBin, Bin]] Wird eingetragen! Please complete

4. Authors: G. Altmann

5. References

Best, K.-H. (2001a). Zur Verteilung rhythmischer Einheiten in deutscher Prosa. In: Best, K.H. (ed.), Häufigkeitsverteilungen in Texten: 162-166. Göttingen: Peust & Gutschmidt.

Best, K.-H. (2001b). Probability distributions of language entities. J. of Quantitative Linguistics 8, 1-11.

Best, K-H. (2002). The distribution of rhythmic units in German short prose. Glottometrics 3, 136-142.

Best, K.-H. (2003). Längen rhythmischer Einheiten. In: Altmann, G., Köhler, R., Piotrowski, R. (Hg.), Quantitative Linguistik - Quantitative Linguistics. Ein internationales Handbuch. Berlin/ N.Y.: de Gruyter (in print)

Brainerd, B. (1976). On the Markov nature of text. Linguistics 176, 5-30. Kaßel, A. (2002). Zur Verteilung rhythmischer Einheiten in deutschen und englischen Texten. Staatsexamensarbeit; Göttingen.

Lehfeldt, Altmann 2003; (Bitte ergänzen, wenn es so was gibt)

Marbe, K. (1904). Über den Rhythmus der Prosa. Giessen: J.Ricker´sche Verlagsbuchhandlung. 6