NOTE: This transcription was contributed by Martin P.M. van der Burgt, who has devised a process for producing transcripts automatically. Although its markup is incomplete, we believe it serves a useful purpose by virtue of its searchability and its accessibility to text-reading software. It will be replaced by a fully marked-up version when time permits. —HR
Copyright Notice | |
The following manuscript | |
EWD 538 A collection of beautiful proofs : | |
is held in copyright by Springer-Verlag New York. | |
The manuscript was published as pages 174–183 of | |
Edsger W. Dijkstra, Selected Writings on Computing: A Personal Perspective, | |
Springer-Verlag, 1982. ISBN 0–387–90652–5. | |
Reproduced with permission from Springer-Verlag New York. | |
Any further reproduction is strictly prohibited. |
A collection of beautiful proofs.
This chapter contains a compilation of beautiful proofs, proofs of which I expect that all mathematicians will agree that they are beautiful. The purpose of this compilation is to collect the material that may enable us to come to grips with the main qualities that together constitute “mathematical elegance”. Further analysis and comparisons of these gems will be postponed until the collection is thought to be large enough. In order to avoid too much of a personal bias (and, also, to build up a larger collection than I could think of myself) I have asked others for their contribution to the collection. The only constraint was that the proof could be appreciated by the “generally educated”; all contributions that required specialized mathematical knowledge had, alas, to be rejected.
1. A classical example.
In the late 18th century a German schoolmaster gave —with the intention of keeping his pupils busy for another hour— the task to sum hundred terms of an arithmetic progression to a class of little boys who, of course, had never heard of arithmetic progressions. The youngest pupil, however, wrote down the answer instantaneously and waited gloriously, with his arms folded, for the next hour while his classmates toiled: at the end it turned out that little Johann Friederich Earl Gauss had been the only one to hand in the correct answer. Young Gauss had seen instantaneously how to sum such a series analytically: the sum equals the number of terms multiplied by the average of the first and the last term. (To quote E.T.Bell: “The problem was of the following sort, 81297 + 81495 + 81693 + ... + 100899, where the step from one number to the next is the same all along (here 198), and a given number of terms (here 100) are to be added.”)
In two respects this is a classical example: firstly young Gauss produced his answer about a thousand times as fast as his classmates, secondly he was the only one to produce the correct answer. So much for the effective ordering of one’s thoughts!
2. The Pythagorean Theorem, proof I.
When I was twelve years old, I learned the following proof, in which
a square with sides a + b is considered in two different ways.
The two expressions are different expression for the same area: they are therefore equal. Next we observe 2ab = 4ab/2 and by subtraction we find a2 + b2 = c2. A beautiful proof in the good old Greek tradition that fascinated me when it was shown to me, and satisfied me for more than 30 years.
3. The Pythagorean Theorem, proof II.
The following proof was shown to me a few years ago. The areas of similar figures have the same relation as the squares of corresponding lines; for three similar figures with areas A, B, and C respectively and corresponding lines a, b, and c respectively, any homogeneous linear relation satisfied by A, B, and C is, therefore, also satisfied by a2, b2, and c2, and vice versa. In particular we know that A + B = C implies a2 + b2 = c2 .
Here we have three similar triangles with a, b, and c respectively as their hypotenuse; the sum of the areas of the first two equals the area of third triangle, i.e. A + B = C, hence a2+ b2 = c2.
4. The Theorem of Pompeiu.
For a triangle ABC of which at least two sides have different lengths, we can choose a point P such that the lengths AP , BP , and CP are such that no triangle can be formed from those three pieces.
This observation led the Rumanian mathematician Pompeiu to the conjecture that, conversely, for a equilateral triangle ABC no such point exists, i.e. that for every point P the lengths AP, BP, and CP satisfy the triangular inequalities. He gave a proof, which —I am told— was very ugly. The following beautiful proof is due to E.R.Veldkamp; it gives a constructive existence proof of such a triangle with sides equal to AP, BP, and CP respectively.
5. Euclid’s Theorem on Primes.
Denoting the integer numbers ≥ 2 by the term “multiples”, we can define the primes as those multiples that cannot be written as the product of two multiples. From this definition it follows immediately that for each multiple there exists at least one prime dividing that multiple.
Let P be a prime; define the multiple Q as the product of all primes ≤ P , increased by 1. The multiple Q has been constructed in such a way, that none of the primes ≤ P devides Q ; the prime dividing Q must, therefore, be > P . Hence there is no largest prime number.
Note. It is not unusual that, after the construction of Q, the proof considers the two cases “Q is a prime” and “Q is not a prime” separately. The above proof shows that this case analysis is superfluous; the case analysis has probably been induced by the linguistic distinction between singular and plural forms. (End of note.)
6. Euclid’s Theorem on the Base Angles of an Isosceles Triangle.
Using the theorem that any two triangles which have two sides and the included angle equal to two sides and the included angle of the other are congruent, it should be proved that the base angles of an isosceles triangle are equal, more precisely, that from AC = BC follows that the angles A and B are equal to each other.
Note. It is not necessary —as Euclid seems to have done— to bisect angle C and then to use the theorem to show that the original trangle is cut into two congruent parts.(End of note.)
7. A covering problem.
Given the figure as shown below that could be covered by 138 squares, and 69 dominoes of two squares each —one such domino is shown below—
Consider the 10*14 rectangle before the two opposite squares have been removed, and colour its squares alternatingly black and white as wit a chess board: the rectangle then shows 70 white squares and 70 black ones. The two squares to be removed have, however, the same colour, and our figure, therefore, has 70 squares of the one colour and 68 squares of the other colour. Each domino covers one white and one black square; together the dominoes cover, no matter how they are placed, 69 black and 69 white squares. As a result they cannot cover the given figure.
8. The Harmonic Series Diverges.
Consider
Sn=1/1 + 1/2 + 1/3 + 1/4 + 1/5 + 1/6 + 1/7 + 1/8 + ... + 1/n .
It has to be shown, that by choosing n sufficiently large, we can achieve
Sn > M for arbitrarily large value M; in other words we have to show that
the sequence S1, S2, S3 is unbounded. We observe that
S2 - S1 = 1/2
S4 - S2 = 1/3 + 1/4 > 1/4 + 1/4 = 1/2
S8 - S4 = 1/5 + 1/6 + 1/7 + 1/8 > 1/8 + 1/8 + 1/8 + 1/8 = 1/2 etc.
In other words: starting with n = 1 , Sn is increased by at least 1/2
each time n is doubled.
9. The Eigenvalues of a Hermitean Matrix are Real.
A Hermitean matrix is the generalization of a real, symmetric matrix; its transpose equals its complex conjugate
AT = A* | (1) |
A.x = lambda.x | (2) |
Taking the transpose of both sides of (2) we get
xT.AT = lambda.xT |
xT.AT.x* = lambda. xT.x* . | (3) |
Taking the complex conjugate of both sides of (2) we get
A*.x* = lambda*.x* |
xT.A*.x* = lambda*.xT.x*. | (4) |
On account of (1) we conclude that (3) and (4) have equal left-hand
sides, and hence 0 = (lambda - lambda*).xT.x*.
Because x is a non-null vector and xT.x* is a sum of absolute values,
we conclude that xT.x > 0 , and hence
lambda = lambda . | Q.E.D. |
10. The Cauchy-Schwarz inequality.
Let a1,..., an and b1,..., bn be 2n real numbers; then the
following inequality holds:
(a1b1 +...+ anbn)2 ≤ (a12+...+ an2)(b12+ ... + bn2)
Consider the following quadratic form Q(x) in x , defined by
Q(x) = (a1 + b1x)2+...+ (an + bnx)2 .
Because for real x, Q(x) is defined as the sum of the squares of n
real numbers, for real x the inequality Q(x) ≥ 0 must hold. In other
words, the equation Q(x) = 0 has at most one real root, and its discriminant
is ≤ 0 . Collecting powers of x in the definition of Q(x) we find:
Q(x) = (a12+...+ an2) + 2(a1b1+...+ anbn).x + (b12+...+ bn2).x2
with the discriminant
(a1b1 +...+ anhn)2 - (a12+...+ an2)(b12 +...+ bn2)
The conclusion that this discriminant is non-positive proves our inequality.
11. Reconstructing an odd polygon from the midpoints of its sides.
We shall show the construction for poly = 5 .
For the pentagon ABCDE , the points marked AB, BC. CD, DE, and EA respectively are the midpoints of its successive sides. Given the positions of those five midpoints, it is requested to reconstruct the original pentagon ABCDE.
Consider what happens when we subject a plane to five successive rotations of 180 degrees each with AB, BC, ED, DE, and EA as the successive centers of rotation. The point that originally coincided with A , coincides with B after the first rotation, with C after the second rotation, etc. and coincides again with A after the fifth and last rotation. Because the pentagon has an odd number of sides, the total transformation of that plane is therefore a rotation of 180 degrees with A as its center of rotation.
We now trace a point in the rotated plane that originally coincides with an arbitrary point X0 . Rotating it around AB gives us its position X1 after the first rotation, rotating that around BC gives us its position X2 after the second rotation, etc. until we have constructed its final position X5 . As that could also have been reached by rotating X0 ever 180 degrees around the —still unknown— point A , we conclude that A is the midpoint of the line from X0 to X5 ! The positions of the other four vertices B, C, D, and E now follow trivially.
12. The number of factors p for p prime in n!
Let n be a natural number, let p be a prime number; let s(n, p) denote the sum of the digits of the representation of n in the number system with radix p . Then the number of factors p in n! equals
n - s(n, p) | ||
----------- | (1) | |
p - 1 |
13. Frank Morley’s Theorem.
The adjacent pairs of the trisectors of the angles of a triangle always meet at the vertices of an equilateral triangle. |
Draw an equilateral triangle XYZ and construct the triangles AXY and BXZ with the angles as indicated in the above picture. Because ∠AXB = 180̊ - (α + β), it follows that if ∠BAX = α + φ, ABX = β - φ . Using the rule of sines three times (in triangles AXB , BXZ , and AXY ), we deduce
sin(α + φ) | BX | XZ.sin(60̊+γ)/sin(β) | sin(α) | ||||
----------- | = | ---- | = | ----------------------- | = | ------- | |
sin(β - φ) | AX | XY.sin(60̊+γ)/sin(α) | sin(β) |
Because in the range considered the left-hand side of this equation is a monotonically increasing function of φ, we conclude that Q = 0 is in this range its only root. Completing the picture and repeating the argument twice we conclude that the angles at A, B, and C are trisected, and thus Marley’s Theorem is proved without the aid of any additional lines.
(To be continued in a later report.)
Transcribed by Martin P.M. van der Burgt
Last revision