NRLgate -
Plagiarism by Peer Reviewers

Sections 3.7 thru 3.12

This page is part of the NRLgate Web site presening evidence of plagiarism among scientific peer reviewers involving 9 different peer review documents of 4 different journal and conference papers in the fields of evolutionary computation and machine learning.

This page contains sections 3.7 through 3.12 of "Evidence of plagiarism in X and Y of a paper on optimal control strategies submitted to the Machine Learning Conference."

Go to top of NRlgate Home Page
Go to Abbreviated Table of Contents
Go to Detailed Table of Contents
Go to Complaint Letter to the Evolutionary Computation journal
Go to top of Previous Page
Go to top of Next Page

3.7. The unlikely coincidence that both reviewers X and Y employed the same quaint and infrequently used word in their reviews

Reviewer Y refers to,

previously used in machine learning studies.

(Emphasis added).

Reviewer X refers to

Previous studies, ...

(Emphasis added).

That is, both reviewers X and Y use the somewhat quaint and infrequently used word "studies" --- instead of a more ordinary word such as "work," "research," "literature," "paper," or "article."

How frequently do contemporary peer reviewers in this field of science use the word "studies"?

I was the general chair of the Genetic Programming 1996 Conference and have access to a computer file containing the 64,109 words of the 316 paper review forms from the 86 peer reviewers of the genetic programming papers at the GP-96 conference. Here is the frequency of occurrence of six alternative words contained in this corpus of writing by peer reviewers in this field:

article --- 11
literature --- 34
paper - 734
research --- 46
studies --- 1
work --- 220

Both reviewers X and Y of my MLC paper on optimal control strategies appear to belong a small minority of reviewers.

Given the infrequency of usage of this particular word by contemporary peer reviewers in this field, isn't it improbable that both reviewers X and Y of the same paper would simultaneously make this particular choice of words (in the space of short reviews containing a few hundred words each)?
One the other hand, if one peer reviewer had created his review while looking at an already written review of the same paper, the probability of this particular choice of words is no longer small.

This is not the only occasion when we encounter two peer reviewers of the same paper both belonging to the small minority of peer reviewers who use this particular quaint and infrequently used word. The infrequently used word "studies" appears 2 times in review #2 of my paper submitted to the Evolutionary Computation journal and 1 time in the signed, non-anonymous review to the MIT Press of my first book. See section 7.6.

3.8. The unlikely coincidence that both reviewers X and Y would abbreviate the paper's title and that they would both abbreviate it to the same 6 words

The title of my submitted MLC paper on optimal control strategies had 14 words.

Both reviewers X and Y abbreviated the paper's title. Moreover, they converged onto the same 6 words:

Genetic breeding of optimal control strategies

How common is the practice of abbreviating the author's title on a paper review form?

We again make reference to the computer file containing the 64,109 words of the 316 paper review forms from the 86 peer reviewers of the genetic programming papers at the Genetic Programming 1996 Conference.

There were 27 reviews of papers whose titles contained between 11 and 17 words (i.e., 14 words plus or minus 3). In 63% of the cases, the reviewer entered the author's full exact title onto the paper review form without modification. Thus, the probability of randomly drawing 2 reviewers who abbreviate the author's title is about 1 in 9.

More importantly, there are many different ways for abbreviating titles that were used by the minority of peer reviewers who modify an author's title. In the case of titles that were abbreviated by more than one reviewer, there was no case where two reviewers abbreviated a title to identical words (or even the same number of words). Moreover, no two reviewers even used the same approach to abbreviation. The different approaches included

using just the first 2 words of the author's title
using the first coherent phrase
using a coherent phrase from a later part of the title
dropping articles, such as "the" and "an"
abbreviating "genetic programming" to "GP"
abbreviating an especially long word
adding "etc." in lieu of deleted words
using ellipsis (3 dots) in lieu of deleted words
using a long dash in lieu of deleted words
leaving the "title" line entirely blank on the paper review form

Without the suggestive power of one already written review, isn't it improbable that reviewers X and Y would both abbreviate a title and that they would also converge to the identical shortened version?

We do not have sufficient data to compute the joint probability of convergent abbreviation; however, a rough guess is that it is around 1 in 100.

This is not the only occasion when I encountered two peer reviewers of the same paper both belonging to the small minority of peer reviewers who abbreviate the author's title and who coincidentally converge on the same abbreviation. As will been seen elsewhere, reviewers #1 and #2 abbreviated the 15-word title of my paper submitted to the Evolutionary Computation journal and they both converged to the same 8 words. See section 4.5.

3.9. The opening sentences of both reviews X and Y gratuitously provided the same unrequested information

The first section of the MLC paper review form asks the reviewer to evaluate the "significance" of the submitted paper.

Significance: How important is the work reported? Does it attack an important / difficult problem or a peripheral / simple one? Does the approach offer an advance. (Emphasis added).

Review X starts his review,

The papers presentsone example of using a genetic algorithm to learn control strategies for a version of thecart-and-polesystem.

(Grammatical error of "papers" in original).

(Emphasis added).

Review Y starts his review,

This paper concernsthe application of ideas from genetic algorithms to abroomstick balancingproblem.

(Emphasis added).

Notice how the first sentence of both reviews ignored the specific question concerning "significance" that was actually asked by paper review form. Instead, both reviews were unresponsive to the question being asked. They both began by gratuitously providing an unrequested summary of the subject matter of the paper.

The paper review form didn't ask reviewer X what the paper "presented" (or "concerned" to use reviewer Y's word). It asked about "significance."

How improbable is it for peer reviewers in this field of science to unresponsively begin their answer to a paper review form's question on significance with an unrequested summary of the paper?

We again make reference to the computer file containing the 64,109 words of the 316 paper review forms from the 86 peer reviewers of the genetic programming papers at the Genetic Programming 1996 Conference.

As it happens, the first substantive question on the paper review form used by the GP-96 conference concerned the "significance" of the paper.

Significance of the Problem: Is the subject of this paper important?.

98.4% (311) of the 316 review documents began by addressing the question that was actually asked by the paper review form. Only 1.6% (5) of these 316 review documents began by unresponsively and gratuitously providing a summary of the subject matter of the paper.

Both reviewers X and Y of my MLC paper on optimal control strategies appear to belong the small minority of reviewers with this unusual pattern of behavior.

Given the infrequency of this unresponsive behavior by contemporary peer reviewers in this field, isn't it improbable that both reviewers of the same paper would exhibit this particular unusual behavior unless one peer reviewer were plagiarizing his review from an already written review of the same paper.

This is not the only occasion two peer reviewers of the same paper both belonging to the small minority of peer reviewers who have the habit of unresponsively providing a summary of the submitted paper instead of the requested evaluation of its significance. As will been seen elsewhere, both reviewers A and B of my MLC paper on empirical discovery belong to this same small minority. See section 2.10.

3.10. Both reviewers X and Y substituted "genetic algorithm" in lieu of the author's chosen term

Review X begins his review for my paper on genetic programming (GP) is as follows:

The papers presentsone example of using a genetic algorithm to learn control strategies for a version of the cart-and-pole system.

(Grammatical error of "papers" in original).

(Emphasis added).

Review Y begins his review for my paper on genetic programming (GP) is as follows:

This paper concernsthe application of ideas from genetic algorithms to a broomstick balancing problem.

(Emphasis added).

Notice that both reviewers X and Y imposed this substitution of "genetic algorithm" for the author's chosen term (perhaps offensive to both) that actually appears in the submitted paper.

This is not the only occasion when we encounter two peer reviewers of the same paper both belonging to the small minority of peer reviewers who impose their own term in this manner in lieu of the author's chosen term. As will been seen elsewhere, both reviewers A and B of my MLC paper on empirical discovery have this same propensity. See Section 2.11. See section 5.4.

3.11. The opening sentences of both reviews X and Y were similar in structure

Notice the overall semantic symmetry of the opening sentences of the reviews X and Y.

Both reviewers gratuitously and unresponsively summarize the subject matter of the paper.
Both reviewers are briefly complimentary ("important" and "better than").
Both reviews then converge to the same complaint ("highly constrained" versus "limited").

Review X begins,

The papers presentsone example of using a genetic algorithm to learn control strategies for a version of the cart-and-pole system. The problem of learning non-linear control strategies is an important one, but the particular problem addressed here is a highly constrained case.

(Grammatical error of "papers" in original).

(Emphasis added).

Review Y begins,

This paper concernsthe application of ideas from genetic algorithms to a broomstick balancing problem. The genetic search process found a control algorithm that solved the problem better than a suboptimal control strategy designed by the second author. The control algorithms found by the genetic search were represented as Lisp S-expressions from a limited set of functions and atoms.

(Emphasis added).

3.12. The time sequence of the plagiarism is suggested because reviewer Y correctly identified the 3 dimensions of the problem a mere 2 sentences before he complained that the paper failed to properly identify them

The previous sections showed how both reviewers X and Y proceeded in lock-step by

their joint abbreviation of the paper's title to the same 6 words
their joint unresponsive and gratuitous substitution of a summary for the requested comments about "significance"
their joint use of the term "genetic algorithm"
the semantic symmetry of their opening sentences
the demonstrably erroneous joint complaint that the submitted paper failed to properly identify the 3 dimensions of the problem (Section 3.2 and (Section 3.5)
the joint use of quotation marks around "three dimensional broom balancing problem" (Section 3.3)
the semantic similarity of the joint complaint about the 3 dimension as a mismatch between the paper's title and contents (and not as an omission of pertinent information) (Section 3.4)
the almost identical 3-part sentence in which the same thoughts appear in the same order using almost identical words (Section 3.1)
the joint mistaken claim concerning the existence of a "standard" version of a particular problem (Section 3.6)
their joint use of the infrequently used word "studies"

All first 10 sentences of review Y are now shown below. Sentences 6 and 7 (in bold) are the end of reviewer Y's gratuitous "summary" of the subject matter of the paper. Notice that reviewer Y, correctly identifies the 3 dimensions in sentence 6. Sentence 8 is the almost identical 3-part explanatory sentence common to reviews X and Y that was discussed above. Sentence 9 (in bold) is the complaint that the submitted paper failed to properly identify the 3 dimensions of the problem.

This paper concerns the application of ideas from genetic algorithms to a broomstick balancing problem. The genetic search process found a control algorithm that solved the problem better than a suboptimal control strategy designed by the second author. The control algorithms found by the genetic search were represented as Lisp S-expressions from a limited set of functions and atoms. Whether they evaluated to a positive or to a negative quantity was used to determine the bang-bang force applied in each control situation. The standard broomstick balancing physics was used, but with a new control objective not previously used in machine learning studies. The goal was to drive the system in minimum time to a near-zero value for the cart velocity, pole angle, and pole angular velocity. Cart position was apparently ignored. The broomstick balancing problem was the standard two-dimensional one in that the cart moved only along a one dimensional track and the pole could swing only forward and back, not right to left. It is not clear what the authors mean by calling their problem "the three dimensional broom balancing problem." The authors do not discuss why they use a broom balancing problem so different from that used by previous machine learning researchers.

(Emphasis added).

As can be seen, reviewer Y correctly identified the 3 dimensions of the problem in his sentence 6, namely

the cart velocity, pole angle, and pole angular velocity. (Emphasis added).

What is happening here?

Apparently, reviewer Y initially followed reviewer X's lead by beginning his review with a gratuitous (and larger) "summary" of the subject matter of the paper. Reviewer Y's 7-sentence summary is reasonably accurate. Then, starting with sentence 8, reviewer Y apparently shifted gears and reverted to reviewer X's already written review for guidance. In sentence 8, reviewer Y paraphrased the previously discussed 3-part explanatory sentence using almost identical words as reviewer X. Then, reviewer Y apparently continued under the spell of reviewer X's already written review in sentence 9. Even though reviewer Y had himself just accurately identified the 3 dimensions of the problem (sentences 6 and 7), reviewer Y parroted (in sentence 9) the complaint of reviewer X:

It is not clear what the authors mean by calling their problem "the three dimensional broom balancing problem."(Emphasis added).

Not clear?

Apparently, this entire process of plagiarism was so thoughtless mechanical that reviewer Y never thought about the absurdity of what he was typing.

Finally, reviewer Y continued under the spell of reviewer X and reworded reviewer X's complaint about the alleged non-standardness of the problem (sentence 10).

The contradiction between reviewer Y's sentences 7 and 9 strongly suggests the time sequence of the plagiarism --- namely, that reviewer Y plagiarized from reviewer X. Even though reviewer Y fully and correctly grasped the 3 dimensions of the paper (as shown by his own sentence 7), reviewer Y was apparently so preoccupied with the mechanical task of parroting reviewer X's already written words that he missed the contradiction between what he himself typed in sentences 7 and 9. This inadvertent error by reviewer Y not only shows that his review was derived from reviewer X's already written review, but it strongly suggests the time sequence of the plagiarism. If the plagiarism occurred in the other order, this internal contradiction would probably have shown up in reviewer X's review.

Author: John R. Koza
E-Mail: NRLgate@cris.com

NRLgate - Plagiarism by Peer Reviewers

Sections 3.7 thru 3.12

3.7. The unlikely coincidence that both reviewers X and Y employed the same quaint and infrequently used word in their reviews

3.8. The unlikely coincidence that both reviewers X and Y would abbreviate the paper's title and that they would both abbreviate it to the same 6 words

3.9. The opening sentences of both reviews X and Y gratuitously provided the same unrequested information

3.10. Both reviewers X and Y substituted "genetic algorithm" in lieu of the author's chosen term

3.11. The opening sentences of both reviews X and Y were similar in structure

3.12. The time sequence of the plagiarism is suggested because reviewer Y correctly identified the 3 dimensions of the problem a mere 2 sentences before he complained that the paper failed to properly identify them

NRLgate -
Plagiarism by Peer Reviewers