Difference between revisions of "References"

m
 
(One intermediate revision by one other user not shown)
Line 1: Line 1:
 
'''1. Problem and history'''
 
'''1. Problem and history'''
  
“Reference” is any sign joining a sentence with any preceding sentence. References are the means for making the text a compact entity. In qualitative linguistics there is an ample literature on kinds and behavior of references. They are the basis for evaluating text cohesion. The sentences joined by a commoh reference are called ''hrebs''.
+
“Reference” is any sign joining a sentence with any preceding sentence. References are the means for making the text a compact entity. They are the basis for evaluating text cohesion. In qualitative linguistics there is an ample literature on different kinds of references and their behavior. The sentences joined by a commoh reference are called ''hrebs''.
The only law, known in literature as “Hřebíček´s reference law”, originates from Hřebíček´s (1985) derivation. Altmann (1988: 81-85) proposed merely some further problems for investigation. The law was corroborated on many Turkish texts. Formula (4) supports Herdan´s version of the type-token ratio (<math>\rightarrow</math>).  
+
 
 +
The only law, known in the literature as “Hřebíček´s reference law”, originates from Hřebíček´s (1985) derivation. Altmann (1988: 81-85) proposed merely some further problems for investigation. The law was corroborated on many Turkish texts. Formula (4) supports Herdan´s version of the type-token ratio (<math>\rightarrow</math>).  
  
 
'''2. Hypothesis'''  
 
'''2. Hypothesis'''  
Line 15: Line 16:
 
Let
 
Let
  
r = number of references in text
+
<math>r</math> = number of references in text
  
s = number of sentences in text
+
<math>s</math> = number of sentences in text
  
n = number of word tokens in text
+
<math>n</math> = number of word tokens in text
  
v = number of word types in text (vocabulary of the text)
+
<math>v</math> = number of word types in text (vocabulary of the text)
  
w = vocabulary richness
+
<math>w</math> = vocabulary richness
  
a, b, c, A, B  = constants
+
<math>a, b, c, A, B</math>   = constants
  
 
Hřebíček´s assumptions:  
 
Hřebíček´s assumptions:  
  
(i) the richer the vocabulary, the smaller is the number of references
+
(i) the richer the vocabulary, the smaller the number of references,
  
(ii) the more sentences are in the text, the greater the number of references.
+
(ii) the more sentences in the text, the greater the number of references.
  
The change of the number of references relative to the change of the vocabulary richness is proportional to the number of sentences   
+
The change in the number of references relative to the change in the vocabulary richness is proportional to the number of sentences,  
  
<math>\frac{\partial r}{\partial w}=As</math>  
+
<math>\frac{\partial r}{\partial w}=As</math>,  
 
 
and, at the same time, the change of the number of references relative to the change of the number of sentences is proportional to the vocabulary richness of the text
+
and, at the same time, the change in the number of references relative to the change in the number of sentences is proportional to the vocabulary richness of the text,
  
<math>\frac{\partial r}{\partial s}=Aw</math>
+
<math>\frac{\partial r}{\partial s}=Aw</math>.
  
yielding the solution
+
This yields the following solution<:
  
(1)<math>r=csw\quad</math> (c = AB).
+
(1) <math>r=csw\quad</math> (<math>c = AB</math>).
  
Taking the simplest interpretation of vocabulary richness w as
+
Taking the simplest interpretation of vocabulary richness <math>w</math> as
  
<math>w=\frac{v}{n}</math>  
+
<math>w=\frac{v}{n}</math>,  
  
 
we obtain
 
we obtain
  
(2)<math>r=cs\frac{v}{n}</math> .
+
(2) <math>r=cs\frac{v}{n}</math> .
  
Using Herdan´s (1966: 76) type-token ratio (<math>\rightarrow</math>) expressing the vocabulary of the text as a power function of its length
+
Using Herdan´s (1966: 76) type-token ratio (<math>\rightarrow</math>) to express the vocabulary of the text as a power function of its length,
  
(3)<math> v=n^a,\quad 0 < a < 1 </math>
+
(3) <math> v=n^a,\quad 0 < a < 1</math>,
  
and inserting it in (2) one obtains
+
and inserting this in (2), one obtains
  
(4)<math> r= csn^{a-1}=csn^b, \quad  a-1 = b \quad    -1 < b < 0</math>
+
(4) <math> r= csn^{a-1}=csn^b, \quad  a-1 = b \quad    -1 < b < 0</math>,
  
meeting assumptions (i) and (ii).
+
which meets assumptions (i) and (ii).
 
 
 
'''Example''': The course of references in a Turkish text
 
'''Example''': The course of references in a Turkish text
Line 70: Line 71:
  
  
'''4. Authors''': G. Altmann
+
'''4. Authors''': U. Strauss, G. Altmann
  
 
'''5. References'''
 
'''5. References'''
Line 87: Line 88:
 
Hřebíček 1985, 1986, 1989, 1992, 2000; Altmann 1988.
 
Hřebíček 1985, 1986, 1989, 1992, 2000; Altmann 1988.
  
 +
'''Hřebíček, L'''. (2006). Text laws. In: Köhler, R., Altmann, G., Piotrowski, R.G. (eds.), ''Quantitative Linguistics. An International Handbook: 348-361.'' Berlin: de Gruyter.
 +
 +
'''Mehler, A. ''' (2006). Eigenschaften der textuellen Einheiten und Systeme. In: Köhler, R., Altmann, G., Piotrowski, R.G. (eds.), ''Quantitative Linguistics. An International Handbook: 325-348.'' Berlin: de Gruyter.
  
 
[[[Das zeichnen geht schwer, da zwei unabhängige Variablen drin sind. Mit Harvard Graphics wirds]]]
 
[[[Das zeichnen geht schwer, da zwei unabhängige Variablen drin sind. Mit Harvard Graphics wirds]]]
  
 
[[Category:Unfertig]]
 
[[Category:Unfertig]]

Latest revision as of 11:01, 16 June 2009

1. Problem and history

“Reference” is any sign joining a sentence with any preceding sentence. References are the means for making the text a compact entity. They are the basis for evaluating text cohesion. In qualitative linguistics there is an ample literature on different kinds of references and their behavior. The sentences joined by a commoh reference are called hrebs.

The only law, known in the literature as “Hřebíček´s reference law”, originates from Hřebíček´s (1985) derivation. Altmann (1988: 81-85) proposed merely some further problems for investigation. The law was corroborated on many Turkish texts. Formula (4) supports Herdan´s version of the type-token ratio (\rightarrow).

2. Hypothesis

The number of references in text depends on the number of words and the number of sentences.

“Word” is every word-like entity (token) in the text. “Sentence” in written texts is demarcated by orthographic signs.

3. Derivation

Let

r = number of references in text

s = number of sentences in text

n = number of word tokens in text

v = number of word types in text (vocabulary of the text)

w = vocabulary richness

a, b, c, A, B = constants

Hřebíček´s assumptions:

(i) the richer the vocabulary, the smaller the number of references,

(ii) the more sentences in the text, the greater the number of references.

The change in the number of references relative to the change in the vocabulary richness is proportional to the number of sentences,

\frac{\partial r}{\partial w}=As,

and, at the same time, the change in the number of references relative to the change in the number of sentences is proportional to the vocabulary richness of the text,

\frac{\partial r}{\partial s}=Aw.

This yields the following solution<:

(1) r=csw\quad (c = AB).

Taking the simplest interpretation of vocabulary richness w as

w=\frac{v}{n},

we obtain

(2) r=cs\frac{v}{n} .

Using Herdan´s (1966: 76) type-token ratio (\rightarrow) to express the vocabulary of the text as a power function of its length,

(3)  v=n^a,\quad 0 < a < 1,

and inserting this in (2), one obtains

(4)  r= csn^{a-1}=csn^b, \quad  a-1 = b \quad     -1 < b < 0,

which meets assumptions (i) and (ii).

Example: The course of references in a Turkish text

Hřebíček (1992) examined the course of references in several Turkish texts. One of these cases is shown in Table 1.

Tabelle1 R.jpg


4. Authors: U. Strauss, G. Altmann

5. References

Altmann, G. (1988a). Wiederholungen in Texten. Bochum, Brockmeyer.

Hřebíček, L. (1985). Text as a unit and co-references. In: Ballmer, Th.T. (ed.), Linguistic dynamics: 190-198. New York, de Gruyter.

Hřebíček, L. (1986). Cohesion in Ottoman poetic texts. Archiv orientální 54,252-256.

Hřebíček, L. (1989). A syntactic variable on the text level. Glottometrika 10, 204-218.

Hřebíček, L. (1992). Text in communication: Supra-sentence structure. Bochum, Brockmeyer.

Hřebíček, L. (2000). Variation in sequences. Prague: Oriental Institute. Hřebíček 1985, 1986, 1989, 1992, 2000; Altmann 1988.

Hřebíček, L. (2006). Text laws. In: Köhler, R., Altmann, G., Piotrowski, R.G. (eds.), Quantitative Linguistics. An International Handbook: 348-361. Berlin: de Gruyter.

Mehler, A. (2006). Eigenschaften der textuellen Einheiten und Systeme. In: Köhler, R., Altmann, G., Piotrowski, R.G. (eds.), Quantitative Linguistics. An International Handbook: 325-348. Berlin: de Gruyter.

[[[Das zeichnen geht schwer, da zwei unabhängige Variablen drin sind. Mit Harvard Graphics wirds]]]