What makes them different?
text A:
I have just returned from a visit to my landlord - the solitary neighbour that I shall be troubled with. This is certainly a beautiful country! In all England, I do not believe that I could have fixed on a situation so completely removed from the stir of society. A perfect misanthropist's heaven: and Mr. Heathcliff and I are such a suitable pair to divide the desolation between us. A capital fellow!
text B:
OF MANS First Disobedience, and the Fruit
Of that Forbidden Tree, whose mortal tast
Brought Death into the World, and all our woe,
With loss of Eden, till one greater Man
Restore us, and regain the blissful Seat,
Sing Heav'nly Muse, that on the secret top
Of Oreb, or of Sinai, didst inspire
That Shepherd, who first taught the chosen Seed,
In the Beginning how the Heav'ns and Earth
Rose out of Chaos: or if Sion Hill
Delight thee more, and Siloa's Brook that flow'd
Fast by the Oracle of God; I thence
Invoke thy aid to my adventrous Song,
Burrowsβs Delta
the mean of the absolute differences between the z-scores for a set of word-variables in a given text-group and the z-scores for the same set of word-variables in a target text.
The simple idea behind Delta
\[\delta = \frac{\left\vert a_1 - b_1 \right\vert + \left\vert a_2 - b_2 \right\vert + ... + \left\vert a_n - b_n \right\vert}{n} \]
or conveniently:
\[\delta = \frac{1}{n} \sum_{i=1}^n{\left\vert a_i - b_i \right\vert} \]
pairwise similarities between texts
| βtheβ |
4.57 |
4.24 |
4.25 |
4.19 |
4.47 |
| βtoβ |
3.11 |
3.29 |
3.43 |
3.14 |
3.71 |
| βandβ |
3.19 |
3.0 |
3.08 |
2.85 |
2.81 |
| βofβ |
2.6 |
3.0 |
2.63 |
2.43 |
2.86 |
| βIβ |
2.17 |
2.2 |
2.13 |
2.42 |
2.22 |
| βaβ |
2.24 |
1.92 |
1.92 |
2.21 |
1.92 |
| . . . |
. . . |
. . . |
. . . |
. . . |
. . . |
pairwise similarities between texts
| βtheβ |
4.57 |
4.24 |
4.25 |
4.19 |
4.47 |
| βtoβ |
3.11 |
3.29 |
3.43 |
3.14 |
3.71 |
| βandβ |
3.19 |
3.0 |
3.08 |
2.85 |
2.81 |
| βofβ |
2.6 |
3.0 |
2.63 |
2.43 |
2.86 |
| βIβ |
2.17 |
2.2 |
2.13 |
2.42 |
2.22 |
| βaβ |
2.24 |
1.92 |
1.92 |
2.21 |
1.92 |
| . . . |
. . . |
. . . |
. . . |
. . . |
. . . |
pairwise similarities between texts
| βtheβ |
4.57 |
4.24 |
4.25 |
4.19 |
4.47 |
| βtoβ |
3.11 |
3.29 |
3.43 |
3.14 |
3.71 |
| βandβ |
3.19 |
3.0 |
3.08 |
2.85 |
2.81 |
| βofβ |
2.6 |
3.0 |
2.63 |
2.43 |
2.86 |
| βIβ |
2.17 |
2.2 |
2.13 |
2.42 |
2.22 |
| βaβ |
2.24 |
1.92 |
1.92 |
2.21 |
1.92 |
| . . . |
. . . |
. . . |
. . . |
. . . |
. . . |
pairwise similarities between texts
| βtheβ |
4.57 |
4.24 |
4.25 |
4.19 |
4.47 |
| βtoβ |
3.11 |
3.29 |
3.43 |
3.14 |
3.71 |
| βandβ |
3.19 |
3.0 |
3.08 |
2.85 |
2.81 |
| βofβ |
2.6 |
3.0 |
2.63 |
2.43 |
2.86 |
| βIβ |
2.17 |
2.2 |
2.13 |
2.42 |
2.22 |
| βaβ |
2.24 |
1.92 |
1.92 |
2.21 |
1.92 |
| . . . |
. . . |
. . . |
. . . |
. . . |
. . . |
pairwise similarities between texts
| βtheβ |
4.57 |
4.24 |
4.25 |
4.19 |
4.47 |
| βtoβ |
3.11 |
3.29 |
3.43 |
3.14 |
3.71 |
| βandβ |
3.19 |
3.0 |
3.08 |
2.85 |
2.81 |
| βofβ |
2.6 |
3.0 |
2.63 |
2.43 |
2.86 |
| βIβ |
2.17 |
2.2 |
2.13 |
2.42 |
2.22 |
| βaβ |
2.24 |
1.92 |
1.92 |
2.21 |
1.92 |
| . . . |
. . . |
. . . |
. . . |
. . . |
. . . |
pairwise similarities between texts
| βtheβ |
4.57 |
4.24 |
4.25 |
4.19 |
4.47 |
| βtoβ |
3.11 |
3.29 |
3.43 |
3.14 |
3.71 |
| βandβ |
3.19 |
3.0 |
3.08 |
2.85 |
2.81 |
| βofβ |
2.6 |
3.0 |
2.63 |
2.43 |
2.86 |
| βIβ |
2.17 |
2.2 |
2.13 |
2.42 |
2.22 |
| βaβ |
2.24 |
1.92 |
1.92 |
2.21 |
1.92 |
| . . . |
. . . |
. . . |
. . . |
. . . |
. . . |
pairwise similarities between texts
| βtheβ |
4.57 |
4.24 |
4.25 |
4.19 |
4.47 |
| βtoβ |
3.11 |
3.29 |
3.43 |
3.14 |
3.71 |
| βandβ |
3.19 |
3.0 |
3.08 |
2.85 |
2.81 |
| βofβ |
2.6 |
3.0 |
2.63 |
2.43 |
2.86 |
| βIβ |
2.17 |
2.2 |
2.13 |
2.42 |
2.22 |
| βaβ |
2.24 |
1.92 |
1.92 |
2.21 |
1.92 |
| . . . |
. . . |
. . . |
. . . |
. . . |
. . . |
Multivariate aka multidimensional
The features (βtheβ, βtoβ, βandβ, β¦) are sometimes called variables.
Consequently, the methods in question are multivariate.
More intriguing is the name multidimensional.
Should I be afraid of multidimensionality? (well, you were there already, and you dindβt even blink!)