Article of Witztum

This is a copy of an article that Doron Witztum published as a PDF file. The Hebrew has been transliterated, using the Michigan-Clairmont encoding scheme. No claims of accuracy are made.
A reply to this article is available.
Does Tolstoy Really Love Brendan McKay?
Doron Witztum, 1997.
 
On July 31st, 1997, Dr. Brendan McKay of the Australian National
University presented an article on the internet, in which he claims
that the personal details of his life "amazingly" appear encoded at
minimum ELSs along with his name, in Tolstoy's War and Peace. He
claims the probability of this occurring by chance to be less than
one in five thousand.
 
In this response, we will show that Dr. McKay achieved his results
through a combination of systematic error and very partial reporting
of the number of trials conducted.
 
The two errors were employed by Dr. McKay with the aim of creating
a "counterfeit" effect of success. We will demonstrate that real
probability involved in this experiment corresponds exactly with
what is expected to take place randomly.
 
A. THE TYPE OF SAMPLE
 
In a previous version of his sample (in a letter sent to the Internet
TCODE group on May 5, 1997), Dr. McKay describes his experiment as
follows: 
 
  Forwarded message:
  From: 0 bdm@cs.anu.edu.au (Brendan McKay)
  Sender: 0 owner-tcode@express.ior.com
  Reply-to: 0 tcode@express.ior.com
  To: 0 tcode@express.ior.com
  Date: 97-05-05 06:02:02 EDT
 
  Just today I calculated my own birth date in the Jewish calendar.
  It was 26th Tishri 5712. Then I tested it against my name BRNDN,
  using the usual date forms and the year forms chosen for our new
  experiment. (My surname is shorter than 5 letters.)
 
  The results are amazing. The average distance over the three date
  forms is 0.176, compared to 0.328 and 0.330 for the two lists in
  the StatSci paper. The average distance over the five year forms
  in the 5-8 length range is 0.261, though it would be only 0.138
  except for a single large value. The geometric means (which are
  a crude measure of how much contribution is made to the
  P2 statistics) are 0.168 and 0.149, compared to 0.210 and 0.215
  for the two StatSci lists. I would be quite a champion in the
  StatSci lists.
 
  Brendan
  PS. Perhaps I should mention that the text in which I measured
  my name was War and Peace.
 
In other words, he searched for "meetings" of the ELSs of the
name BRNDN (Brendan) with those of his birth date (day and month)
in three forms, and with the year of his birth written in six forms.
 
This type of sample, in which we measure the proximities between
ELSs of a single expression and those of a list of other expressions,
we termed a "heading sample". In our paper "Hidden Code in the Book
of Genesis", ("CPN XMWY BSPR BR)$YT" preprint 1996; originally
presented as a lecture before the Israeli National Academy of
Sciences, 19 March, 1996), we analyzed this type of sample, and we
explained how to carry out the randomization process for it.
 
In his final version, Dr. McKay selected another heading, DR MQYY 
(Dr. McKay) and created another heading sample with the same dates;
the final product being the union (merger) of these two samples.
(He did one other experiment, searching for "Canberra". This will
be discussed in section F).
 
B. THE METHOD OF MEASUREMENT
 
As we explained in the above-mentioned paper, the proper method for
analyzing a heading sample has three stages:
 
1. We use the function c, ("the Corrected Distance" in the
 Statistical Science paper), to measure proximity between the heading
 and each expression w (from the list). However, the function has
 the following alteration: the heading is only taken as ELSs, while
 the expression w is taken as ELSs and as PLSs (perturbed letter
 sequences). In other words, ELSs of w compete with the PLSs of w
 over the more successful proximities to the ELSs of the heading.
 
 If there are n expressions in the list, the values of P1 and P2
 (the Overall Proximity Measures) are calculated based on the n
 results.
 
2. Given a list with n expressions, a series of n permutations is
 randomly drawn: permutation i changes the order of the letters in
 expression i. In this way, n new expressions are derived and their
 proximities with the heading are calculated (as explained above in
 stage 1). The values of P'1 and P'2 are calculated for this series
 of n permutations.
 
3. This process is carried out N times, and the rank order of P1
 and P2 is calculated.
 
C. THE RESULTS FOR MCKAY'S SAMPLE:
 
When Dr. McKay's double "heading" sample is calculated in this manner,
the following results are obtained:
 
At stage 1: P1 = 0.082 and P2 = 0.031 At stage 3: The rank order of P1
for N = 100,000: 3955 The rank order of P2 for N = 100,000: 1199
 
i.e., the probability is p = 2 x 0.012 = 0.024, that is, only 1/40,
instead of 1/5000.
 
(Of course, 1/40 is the probability if the headings BRNDN and DR MQYY
were the only ones checked. But this is not the case. In section E, we
will show that the final probability is actually much higher than even
this.)
 
In what follows, we explain the source of the differences between
these findings and those of McKay:
 
D. MCKAY'S SYSTEMATIC ERROR
 
1. A few months ago, (in a posting called "Equidistant Letter Sequences
in Genesis- A Report. Feb. 22, 1997), Dr. McKay noted that some of the
names in the list of Rabbis included in the experiment published in
Statistical Science, had the following property: their ELSs appeared
more often than their PLSs. This advantage may allow for easier
"successful" proximities with ELSs of other expressions. Suppose that
expression x possesses this advantage. If we calculate the proximities
between its ELSs and those of the series of expressions, w1, w2,.. wn,
we will receive an overabundance of "successful" results. This will
happen whether the other expressions are related to x or not. This
is a systematic error because we exploit a certain property of the
expression more than once.
 
This possible advantage connected to the number of ELSs of a word, may
be only one possible advantage that an expression may have. Regardless
of the origin of the advantage, let us call expressions with these
types of advantages "charismatic" expressions.
 
Naturally, besides charismatic expressions, some expressions may also
be "anti-charismatic". For instance, if the ELSs of an expression
appear less often than its PLSs. Such expressions will tend to produce
"failures" in measuring proximities with other expressions. In the
sample of the Rabbis, there were many names and appellations. Some may
be charismatic, and others may be anti-charismatic. One would expect
that these effects will balance each other out.
 
The permutations test proposed by Professor Persi Diaconis, as
described in the Statistical Science paper, is supposed to eliminate
any such residual effects of charisma for symmetry considerations. If
we assume that the success of the proximities of name x is due to its
charisma, then it should be just as successful with unrelated dates
to which it is paired as to related dates. Thus, the permutation test
should cancel the "charisma" (advantage) of individual terms.
 
However, Dr. McKay (in his Report) argued that the permutations test
may not have succeeded in canceling this effect. We thought Dr. McKay
was wrong, and that Professor Diaconis' permutation test was a good
one. Nonetheless, it occurred to us that to demonstrate to Dr. McKay
that he is incorrect, one could apply the test described in B1
(described in the above mentioned paper). In a response sent to him
(A Preliminary Analysis and Comments on the Report of New ELS Tests,
June 19, 1997. Posted on the internet), Professor Eliyahu Rips proposed
the following test (Stage 1 in section B above):
 
To repeat the experiment described in Statistical Science, with the
single variation that the names will be taken only as ELSs, while the
dates as ELSs and PLSs. This simple procedure cancels all possible
effects due to charismatic names.
 
When we carried out this experiment, it turned out that the results
improved. Whereas the ranking of the most successful statistic P4 was
originally four in a million, it now improved to one in a million. In
other words, the success of the Statistical Science experiment was
clearly not caused by this effect.
 
2. When one is conducting an experiment that involves a heading sample,
the need for caution cannot be overstated. If the expression being used
as a heading happens to be charismatic, then it will "succeed" with
many expressions, whether they are related or not.
 
It appears that Dr. McKay neglected to take into account his own
observation: he happens to have used charismatic expressions for
the heading in his sample, and seems not to have realized that the
"success" of his results is entirely artificial. Moreover, the
randomization technique Dr. McKay performed, instead of offsetting
the charisma, ignored it altogether. Indeed, as per the discussion
in section B above, the use of only ELSs for the headings and the
appropriate randomization technique, is sufficient to make the
artificial result of 1 in 5,000 into a real one of 1 in 40.
 
3 . In my opinion, there is an even more basic approach to the
prevention of errors of this sort, and it touches on the question of
the exact phenomenon being traced. Here is not the place to discuss
this subject at length. It will be covered in detail in my forthcoming
book.
 
One more remark: in the summer of 1985, we set out to define a
"distance" between two expressions, w and w'(a measure of proximity).
In general, each expression will be represented by several ELSs.
First, we define a "distance" between a particular ELS of w and a
particular ELS of w'. We then have two options: a) To sum all the
"distances" between ELSs of w and ELSs of w'. We call this option TOT.
b) To take only the best "distance" of all these distances. We call
this option BEST.
 
At the time, Professor Rips suggested that TOT would provide a more
stable measure, so we used it for all our experiments, including the
Rabbis experiments. This is the function   described in our 1994
Statistical Science paper, from which function c is derived.
 
Last year, I conducted several experiments (also described in detail
in my forthcoming book), using the BEST measure. I believe that BEST
avoids the problem of charisma, arising from ELSs appearing more
often than expected. It must be stressed, however, that BEST should
be considered only as an additional or complementary measure of
proximity, not as a replacement for TOT.
 
If we measure Dr. McKay's sample using BEST, after randomization
(section B, stages 2 and 3) the resulting p value is reduced even
further to p = 0.019 x 2 = 0.038, or roughly 1/25. In other words,
using BEST, there is no success for Dr. McKay's list.
 
E. SELECTION OF THE HEADINGS:
 
Until now, we did not take into account the number of headings used by
Dr. McKay. Basically, there are two possible ways of transcribing the
name "McKay." There is a transcription according to the written
characters and a phonetic transcription (which Dr. McKay claims he
prefers). There are also two ways to spell the name, either with a
single "Y" at the end or with a double "YY". It is a mistake to write
McKay with a double "YY". Nonetheless, since he used it, we will go
along with his choice.
 
1. Transcription according to the written characters: Dr. McKay uses
the Encyclopedia Hebraica for transcription. For example, he directs
us to copy the name QNBRH (Canberra) from there. In this encyclopedia,
in volume XXII, p. 965, (eighth line from the end of the page), one
finds two transcriptions beside each other: MQQLWQ for McCulloch and
MQY for Mackay. According to the encyclopedia, Dr. McKay should
properly spell his name: MQQY.
 
Moreover, Dr. McKay himself, in a paper describing Chanukah codes in
War and Peace, used the transcription MKQY (with a single "Y").
 
Dr. McKay also provides us with an explanation why he ignored the
very encyclopedia he used in his present article and why he is not
consistent in his usage of MKQY as before. He states that the only
time his name was mentioned in the Israeli press (the Israeli
newspaper Maariv), it was spelled MQYY. In truth, we think that the
transcription MQYY is a mistake altogether. A newspaper should not
be considered an authoritative source for spellings of transliterated
words. Nonetheless, since he used it, we will consider its use.
 
However, Dr. McKay's claim that his name was never otherwise mentioned
in the Israeli press is simply not true. In an article in the magazine
Mishpacha, published on the July 3, 1997, (page 13, Column 1, Line 14),
his name is spelled MQQYY. And if Dr. McKay should claim that he did
not know of this publication, it would be a strange claim  the
dedication at the head of his own article: LA.L. M$T"P )MYC (M HAMT
is based on that same piece in Mishpacha.
 
In summary, using the transcription according to written characters,
in view of previous publications by Dr. McKay, the sources he has
variously cited himself, and the fact that he sometimes uses one "Y"
and sometimes two, we have with the following eight variations: 
MKQY, MKQYY, MQQY and MQQYY, and with the D"R (Dr.) as a prefix:
DR MKQY, DR MKQYY, DR MQQY and DR MQQYY.
 
Of these, there are 5 expressions of 5-8 letters each, which appear
as ELSs in War and Peace: D"R MKQY, D"R MKQYY, MKQYY, D"R MQQY, MQQYY.
It should be remembered that according to the criteria established
in his email (page 1 above), Dr. McKay only wanted to use his surname,
but in fact, he used both his surname and the D"R prefix.
 
2. The phonetic transcription: According to the above-mentioned
Encyclopedia, the name should be spelled MQY with a single "Y".
This usage of one "Y" is how he himself spelled it in his paper
on Chanukah. Here he freely adopted MQYY with a double "YY".
 
By adding the prefix D"R, we have 2 expressions of 5-8 letters each:
D"R MQY and D"R MQYY.
 
In all, we have found 5+2=7 different expressions which may be
found as ELSs in War and Peace.
 
3. The transcription of his first name: Dr. McKay spells the name
Brendan: BRNDN, but in the same article of the Maariv newspaper
which is the source of his spelling of MQYY, indeed, at the same
place in the article, the name is written BRNDWN rather than BRNDN.
 
In summary, Dr. McKay thus appears to have tried at least 16 different
headings. According to his letter, he tried BRNDN separately and
according to his article he tried combinations. In other words, he
could have readily tried BRNDN separately and also BRNDN plus
each of the above 7 possibilities, i.e., a total of 8 headings. The
same goes for BRNDWN, which translates into a total of 16 different
headings. Please note that all of the above calculations are based on
Dr. McKay's own data and selections, and we have made no references to
other possibilities which may have been taken into consideration.
 
F. THE SELECTION OF THE EXPRESSIONS IN THE LIST:
 
Once Dr. McKay added QNBRH (Canberra, which is not the place of
birth) to the list, he revealed that he had an inestimable space of
biographical details to work with. Not only is it prohibited to include
QNBRH on the list, it also casts a dark shadow on the remainder of
the list.
 
Indeed, it is clear that Dr. McKay was interested in both the date
and place of his birth. Thus, his list of names and dates is itself
a partial list of the biographical details of his birth: it contains
only the date (day and month), and the year, but does not include
the place.  If he takes a partial group of the details, we have to
calculate how many sub-groups he could have taken, searching for
that cluster which offers the most impressive proximity. Dr. McKay
had three options available:
 
1. Dates stating only the day and month (K"W T$RY, etc.).
2. Dates stating only the year.
3. Options 1 and 2 combined.
 
G. SUMMARY:
 
If we compile all the selections Dr. McKay could have made for the
headings and for the list, it follows that even the most conservative
calculation leads to at least 16x3=48 different possibilities. In
fact, in his email, Dr. McKay reports the use of additional two
statistical analyses beyond P1 and P2, so the final probability 
obtained in section C (p=0.024) should be multiplied by at least
2x48=96.
 
In other words, the statistical significance of Dr. McKay's results
does not deviate at all from that expected for a random event.
 
Note: All the above calculations are based on the assumption that
the data supplied by Dr. McKay is accurate. We have reason to be
skeptical of its accuracy given the serious flaws in the data of his
earlier reports.
 
We have shown that Dr. McKay's success was in fact artificial.
For the sake of comparison, we will shortly publish an authentic
example of a heading sample.