Borwein - Pi and The AGM (Wiley, 1987)
Borwein - Pi and The AGM (Wiley, 1987)
Borwein - Pi and The AGM (Wiley, 1987)
JONATHAN M. BORWEIN
PETER B. BORWEIN
Department of Mathematics
Dalhousie Urtiversity
Halifm, Nova Scotia
A Wiley-Interscience Publication
JOHN WILEY & SONS
NewYork
Chichester
Brisbane
Toronto
Singapore
8979323846
3282306647
6659334461
0631558817
5759591953
2643383279
0938446095
2847564823
4881520920
0921861173
5028841971
5058223172
3786783165
9628292540
8193261179
6939937510
5359408128
2712019091
9171536436
3105118548
5820974944
4811174502
4564856692
7892590360
0744623799
5923078164
8410270193
3460348610
0113305305
6274956735
0628620899
8521105559
4543266482
4882046652
1885752724
8628034825 3421170679
6446229489 5493038196
1339360726 0249141273
1384146951.9415116094
8912279381 8301194912
9833673362
0005681271
4201995611
5024459455
5982534904
4406566430
4526356082
2129021960
3469083026
2875546873
8602139494
7785771342
8640344181
4252230825
1159562863
6395224737
7577896091
5981362977
3344685035
8823537875
1907021798
7363717872
4771309960
2619311881
9375195778
6094370277
1468440901
5187072113
7101000313
1857780532
0539217176
2249534301
4999999837
7838752886
1712268066
2931767523
4654958537
2978049951
5875332083
1300192787
8467481846
1050792279
059731733
8142061777
6611195909
7669405132
6892589235
1609631859
7669147303
2164201989
3809525720
5574857242
8583616035
9331367702
6782354781
1065485863
4541506959
6370766010
8989152104
6360093417
2788659361
5082953311
4710181942
7521620569
2164121992
5338182796
6861727855
9555961989
6602405803
4586315030
8230301952
8890750983
4676783744
8150193511
2861829745
0353018529
8175463746
9448255379
2533824300
5570674983
6899577362
4939319255
7747268471
3558764024
8505494588
2599413891
0604009277
0404753464
7496473263
5869269956
2497217752
0167113900
6208046684
9141992726
9092721079
8347913151
9848824012
2590694912
0426992279
7509302955
3211653449 8720275596 0236480665 4991198818 3479775356 6369807426 5425278625 5181841757 4672890977 7727938000
8164706001 6145249192 1732172147 7235014144 1973568548 1613611573 5255213347 5741849468 4385233239 0739414333
4547762416 8625189835 6948556209 9219222184 2725502542 5688767179 0494601653 4668049886 2723279178 6085784383
igTz$W$gtS
~ ~ $ 7&?$WtR$
~ ~ ~ ~RtLWW!$ $~~~~$P%$~
lb3$f$2$13 atJ3zi$3$$ ~ , 5 $ $3+:GWOb+
Xo
:= 2'12
48$27%i;$$: $b8$$;6lW
Preface
vii
Preface
ix
-1
-6
Complete Elliptic
Integrals and the
Arithmetic-Geometric
Mean Iteration
The Complexity
of Calculating
Algebraic Functions
J.
J.
-2
7
The Complexity
of Pi and the
Elementary Functions
Theta Functions,
The AGM
and
Algorithms for Pi
r
J.
10
Other ~ G r o a c h e s
to the
Elementary Functions
&\
Jacobi's TripleProduct,
Theta Functions and
Number Theoretic
ADDli~ati~n~
Modular Equations
a
n
Transformation
Theory of
Elliptic lntegrals
5
Modular Equations
and Algebraic
Approximations
to Pi
--
General Means
Mean Iterations
11
Fu"l
Applications of
Theta Functions
The Story of Pi
Computation
and
Transcendence
---+
Necessary Reading Order
--
.
) Possible Reading Order
viii
Preface
Biographical Information
Contents
Chapter 1
1.1
1.2
1.3
1.4
Chapter 2
3.1
3.2
3.3
3.4
xiii
33
xiv
Contents
3.5
3.6
3.7
Chapter 4
~r
Chapter 6
6.1
6.2
6.3
6.4
Chapter 8
8.1
8.2
8.3
8.4
Contents
8.5
8.6
8.7
8.8
Chapter 9
9.1
9.2
9.3
9.4
9.5
281
Chapter 10
Pi
rr
337
316
Chapter One
1.1
(1.1.3)
c , + ~ : =$(a,- b,).
b, r b,+l
5 a,+l 5 a,
and
We will refer to this limiting process as the AGM and usually reserve the
symbols a,, b,, and c, for variables bound by the AGM relations. We also
reserve the symbol AG(a, b) for the common limit.
We say that a, + a ,with pth-order convergence if
-a
- a ) p = O(l)
an+1
(a,
where, as usual, a, = O(j3,) means that, for some constant c and for all n,
a,,r cp,. If the a, are functions defined for all x in a set K, and if the
implicit constant concealed by the 0 symbol in (1.1.6) is independent of x ,
then we say that the convergence is uniformly pth order. Roughly speaking,
quadratic (second-order) convergence doubles the number-of-digits agreement between successive iterates and the limit, cubic (third-order) convergence triples the agreement, and so on.
It is an easy observation from (1.1.4) that the AGM converges uniformly
quadratically for a,, b, restricted to compact subsets of (0, m). Very precise
estimates for the rate of convergence will be established later.
We also observe that M(a, b) is homogeneous, that is, for A > 0
(1.1.7)
and thus there is little loss of generality, though often some loss of
symmetry, to setting a = 1.
The function M satisfies
The analysis of the limit of the AGM rests on finding a two-variable function
M invariant under the transformation (1.1.8) or, equivalently, on finding a
function f satisfying the functional relation (1.1.9), namely,
If we set k, := x E ( 0 , l ) and
m.
an=an+l+cn+l
bn=an+l-cn+l
c 2n = a2n - b : .
a _ , = 2"a:
b-, =2 " ~ :
C-,
= 2"b:
where a:, b: and c: are generated from the AGM commencing with
a: := a,, bg := c,, and c: := b,. Show that
2.
Show, for a,, Po E ( 0 , w), that the above iteration converges quadratically to H(ao, Po), where
H(a07
1
M ( l la,, 1 I&)
'
Po E ( 0 , w), that
4.
agreed to at least eleven decimal places. He commented in his diary that this
result "will surely open up a whole new field of analysisv-a claim vindicated by the subsequent directions of nineteenth-century mathematics. The
inverse of the above (indefinite) integral is the lemniscate sine, a function
Gauss studied in some detail. He had recognized it as a doubly periodic
function (see Section 1.7) by the year 1800 and hence had anticipated one of
the most important developments of Abel and Jacobi: the inversion of
algebraic integrals.
We now outline Gauss's derivation of the limit of the AGM (Gauss
[1866]). This is not the easiest development but it may be the most
motivated.
Theorem 1.1
Second proof.
Let
The two basic integrals we will encounter are the complete elliptic integral of
the first kind,
The complementary integrals E' and K' are the integrals in the complementary variable k' = I&-?,
For the complete elliptic integrals we have the series expansions, for
lkl<l,
where (2i - I)!!:= 1 . 3 - 5 - (2i - 1). The derivations are left as Exercise 1.
We have, as in Exercise 2 of Section 1.2, that K and K t are both solutions
of
This equation has a regular singular point at zero (the roots of the indicia1
equation are both 0 ) which, as the reader familiar with the elementary
theory of second-order differential equations knows, says that
(1.3.9)
K 1 ( k )= a log k K ( k ) + f(k)
(1.3.11)
E t ( k )= 1 +
[log
(i)
A]
-
k2
dE
-=dk
E-K
k
10
dE
-=dk
E-K
k
4.
9%
b) K1(k)= log(;)
+ 0(k2110gkt)
k J 0.
Hint:
d0
v1- (1 - k2)sin28
k' sin 0 d0
v k 2 + (kt)' cos2 8
+
[I2
~~
1+ kt sin 0
The second integral evaluates to log [(I + k') lk]. (See Borwein and
Borwein [84a].)
5.
6.
d2y
i ( 1 - z) 7
dz
1.4
12
Theorem 1.2
For k E (0,I),
(upward)
2
(b) K(k) = 1+ k'
1- k'
id
(downward)
(upward)
1- k'
- klK(k)
(downward) .
Proof. Parts (a) and (b) are equivalent and follow from the previous
discussion. We give a direct proof of (b) based on Ivory [1796]. This is also a
third proof of Theorem 1.1. Let 1:= (1 - k')l(l + k'). Then we have
V? = kl(1 + k') and 1+ 1' = 2(1+ k") / ( I + k')'. Thus
Here we have used the binomial theorem twice. Since only the terms with
m = n are nonzero, we have
on using (1.3.6). This completes (b). If g(k) := 22/lt/(l+ k), then g-l =
(1 - k') l(1 + k') and [I + gl(k)] I2 = 1/ ( I + k). Now (a) follows by substituting g(k) for k in (b). To derive part (c), we differentiate (a) to get, for
K = dKldk,
(1.4.2)
and
Theorem 1.3
Proof. Part (a) has been observed in the second proof of the fundamental limit theorem (Theorem 1.1).
To see part ( b ) , notice that if k , := c,la,, then k; = b,la,,
and
J(an,b , ) = a n E ( d l Recall that c: = a:
- b:.
5)
an
anE(kn).
14
(a) ~ ' ( k , )=
2
qn*2 .=,
fi (1 + k')
1+ k,
"
(upward iteration)
=
,,
where
and
k, E (0, 11 .
2
-2
1+ k:,
n (1 + k,)
.=,
2fi
knfl := 1+ k,
(b) K(k,)
;.=n,
"
(downward iteration)
where
1- k:,
k,+, := 1+ k:,
and
k, E [O,1) .
( b ) E ( k ) = ( 1 - C 2"-lc:) ~ ( k ) .
n=O
Proof. Part (a) is Theorem 1.1. For part ( b ) we use Theorem 1.3,
2J(an.+1,b,+J
- J(a,,
bo)
2
and since 4a,+,
- 2a,2 - 2a,bn = -cn,2
2n+1[~(an+17
b,+d - a:+14a,, boll - 2"[J(an7b,) - a:%,
n-1
=2
cnI(ao,b,)
b0)l
Thus on summing
m
d8
since c: tends to
~0
16
a.
2.
and hence
Also
and
4.
Po:=
18
and
and
d)
F(+,, k,) =
(n-1+
"
.=o 1 k,
log tan
($ + 51 lim 4,) .
n-+m
where c := [ b m + a m . -] I
This result, which dates
from 1753, is, according to Birkhoff [73], the "first notable theorem
about elliptic integrals."
1.5 JACOBI'S DIFFERENTIAL EQUATION AND A FOURTH PROOF
OF THE FUNDAMENTAL LZMZT THEOREM
Equivalently
G(k) := k 1 ' 2 k t ~ ( k ) and
satisfy
G*(k) := k 1 ' 2 k t ~ t ( k )
19
Also, the functional relations [(Theorem 1.2(a) and Exercise la) of Section
1.41 become
2fi
(1.5.2)
and
G*(k) :=*(I
(1.5.3)
+k
2fi
) J Z G*(m)
Theorem 1.4
Suppose that f, g, and a are all in c2(0, 1) and that g maps [O,1] into 10, I].
If, for x E (0, I),
and
(b) f(x) =
f( g(x))
c a constant
then
Proof.
d2f(g)
df( g)
f=(2$g+pg)-+$f(g)+g2p
7
ds
dg
where p :=
(remember that d! is the derivative with respect to x). Also,
from (a) and (b),
(1.5.6)
fff=
fi
(1.5.7)
f = af = apf(g)
20
Suppose G ( x ) := X " ~ X ' K ( X )If. p and g are algebraic functions that map
[O, 11 into [0, 11 and
where
-(7)
1 1+x2
.
4x2 l - x
Proof. We may assume that G ( x ) does not satisfy any equation of the
form
r(x):= -
21
and
Then
2vT
g(x) := 1+ x
then so is
flg1W
and
~/m
If we multiply by
and take the limit as x-+ 0, we deduce that c = 1.
We have thus proven that G satisfies equation (1.5.4). Equation (1.5.4)
transforms into
23
and that
and that
Suppose also that f does not satisfy a linear differential equation with
algebraic coefficients of order less than 3. Show that
ax
+b
id4 = ,d
and
The four quantities K, E, K', and E' are related by a remarkable relation.
24
E K r + E'K- K K r = c .
It remains to evaluate the constant c. This may be done directly from the
series expansion at zero (see Section 1.3, Exercise 5), or by using Exercise 4
of Section 1.3.
We define the gamma and beta functions by
(1.6.4)
T(x) :=
low
re(x) > 0
e-' tx-' dt
~ ( x ) ~-(x)l = -.
sin wx
These standard results will be discussed further in Section 3.6. Our present
objective is to evaluate K(l / f i ) and E(l l f i ) .
Theorem 1.7
and
Proof.
Finally, by (1.6.6),
and
fin
26
Values of k for which K'IK is a rational surd are of considerable interest and
importance. (See Chapter 4.) Such k are called singular values, the simplest
of which is l l f i , where
Start by writing
Then set u :=
1.6
2.
Legendre's Relation
Show that
3. Show that
lo
1 e-x2(t2+1)
d t
and
g(x) :=
t2
dt
- 1 is a
[In fact ~ ( f i - 1) = (fi
1)1'2,r($)~(~)/213'4~1'2.]
Thus
second singular value. We return to both singular values and r function
evaluations of K subsequently.
Establish Legendre's relation directly from (1.3.12) and (1.3.13) applied
to E, E', K, and K'. Use Exercise 4 of Section 1.3 to determine the
constant.
From the general theory of the Weierstrass function developed in the
next section, or more directly, one can show (Whittaker and Watson
[27, p. 5161) that for a, b > 0,
a) If g2 = 0, then
a2 + b2 = 2fi12g31113
a2 - b2 = -3(2g3)1'3
28
c)
and
The study of elliptic functions began in the 1820s when Abel and Jacobi
independently discovered that the inverses of elliptic integrals are doubly
periodic functions. As noted earlier, they had been anticipated by Gauss
who at the turn of the century had studied a particular elliptic function, the
lemniscate sine. This work, however, had not been published. This was one
of the critical discoveries of the era and was vital to the concomitant
development of complex analysis and the later development of modular and
automorphic functions.
29
Except for minimal application in Sections 2.6 and 2.7, we have no great
need for this body of material and offer only a brief sketch of this attractive
theory, primarily in the exercises at the end of this section. Details may be
found in Whittaker and Watson [27] and, more recently, in Bowman [53],
Du Val [73], Eagle [58], and Lang [73].
An elliptic function is a meromorphic function with two periods w, and
w,, with im(w,lw,) # 0. That is, for all z, f(z) = f(z + w,) = f(z + w,). We
assume that w, and w, are minimal in the sense that they are not multiples
of any smaller period. The function f is completely determined by its
behaviour on any parallelogram that is a translate of the parallelogram with
vertices 0, w,, w,, and w, w,. Any such parallelogram is called fundamental. Associated with the periods w, and w, there is the lattice L := {nw, +
mw,)m, n integers). It transpires that given any lattice L there exist elliptic
functions with the appropriate associated periods. However, there are not
many such functions. As we shall indicate in the exercises, any two such
functions are connected by an algebraic equation. The circular functions
may be thought of as degenerate elliptic functions (one of the periods is
infinite), as may rational functions (both periods infinite).
A function f is said to have an algebraic addition theorem if there exists a
polynomial R in three variables with complex coefficients so that for
x, Y E@,
(1.7.2)
U=
dt
V(1- t2)(kf2+ k2t2)
and
30
sn(u, 0) = sin u
and
sn(u, 1) = tanh u
where the product is extended over the zeros off, choosing only one
representative from each pair (z, - z mod L ) and where Si is the
multiplicity of the zero (except in the case zi = -zi mod L, in which
case 6,is half the multiplicity). Now shqw that f and g have the same
zeros with the same multiplicities. Treat the poles similarly.
Show that any elliptic function f is a rational function of p and p.
Hint: Consider
Hint: Consider the order of the pole ofp2 - 4p3 at zero. Observe that
g2 and g, depend on L. Different lattices lead to uniformizations
(parametrizations) of different cubics.
Show that any two nonconstant elliptic functions f and g (with respect
to the same lattice) satisfy an algebraic equation
Chapter Two
34
63(q) + 64(q) = 2 n C
qn2=283(q4>.
even
Also
k,m= -m
k+m odd
+ 1, gives
Hence
~ ( 1k,t ) = e;'(q)
for
k t = e:(q)lei(q)
In particular,
Proof. This follows from the previous discussion and Theorem 1.1. The
uniqueness of q will be obvious from the results of Section 3.1, which will
show that 0, decreases and 0, increases on (0,l). Cl
The results of Theorem 2.1 remain true more generally in the complex
plane. This is discussed in Exercise 4.
Comments and Exercises
The solution of the AGM in theta function terms can be found in Gauss
36
1. a) Let f : Z x
2.
for such q. [That 8Jq) does not vanish will be apparent from Section
3.1.1
2.2 POISSON SUMMATION
A most analytically accessible route to the behaviour of the AGM lies in the
Poisson summation formula, which we now describe. We then give some
examples of its use before returning in the next section to its relationship
with the AGM. The formula we need is:
2.2
Poisson Summation
37
Theorem 2.2
2.1.
n>-x
If x := 0, we derive
which yields
on replacing y by 2nx.
If x:= 1, we derive
xZ
otherwise
'
38
F(+)
where
lo
m
(2.2.4)
F( y ) :=
z,-
e-x2cos (2xy) dx = - e
2
Again, analyticity considerations show that (2.2.5) holds for re(s) > 0. This
is a general form of the theta transformation formula. For future reference
we make the notational agreement that Oi(s):= Oi(e-"") and observe that for
x := 0 , (2.2.5) can be written as
Note that for large s the sum t/S03(s)converges much more rapidly than
03(s-I). For example, if s := 100, 03(0.01) and 10 + 20e-loo" coincide
through more than 500 digits.
Comments and Exercises
The result of (2.2.1) was known to Poisson by 1827. With x := 0 he had
obtained the formula in 1823. Equation (2.2.5) was first obtained by Jacobi
using elliptic function theory in 1828.
2.2
Poisson Summation
c)
2.
n-
cot (
~ 2=
)
1 "
2z
-+2
z ,,=I z - n
Show that
whenever a ,
while x := gives
(2.3.2)
d e 2 ( e - s " ) = e4(e-*'">
d e 4 ( e - " ) = e2(e-*Is)
k(e-*s) = kt(e-*Is) .
Thus
on using Theorem 2.1 and (2.3.1) again. This produces the following
fundamental theorem.
Theorem 2.3
bho
b:
while
2 dk
a =- O
k h , '
42
log q,
(2.3.12)
(2.3.13)
(w.r.t.s)
e3
s(s)
'33
+ s-'
'33
+ s-l 4 (s-l) .
3' 2
on using (2.1.10).
We can now also express E in terms of theta functions. We have from
(1.3.13),
and
Similarly
and
b) show that 4~ = (x;=-, e-n2p)l@i=-, n e -n2p)c) Let r,(n) give the number of distinct representations of n as a sum
of k squares. Show that
where x, := k-', y := 6,
and yo is undefined. Moreover,
and
so that for n r 1
(2.4.6i)
T. In
the next
46
2.
.- 2114
In particular with x, := f i and yl .-
5. Show that
$)
~ ( 1 ,
= a-112~2(:)
2.5
and
&(I,
1
z)
=2 f i a -
512
F (4)
The systematic use of the derived AGM leads directly to quadratic algorithms for IT.
Algorithm 2.1
Let x, := fi, .rr, := 2 + f i , and y, := 2lI4. Define
+ 11Y, + 1
x, + 1
(iii) wn:= IT,,-*Yn + 1
Yn<
(ii) Y , + ~ : =
n21
nrl.
IT.
Moreover, for n r 0,
and, for n 2 2,
Proof. We first establish the limit. This follows from (2.3.7) (see
Exercise 1) or from (2.4.7) as we now indicate. Let nn:= ~ f i b ~ + ~ a ~ + ~ / a ~ + ~
where a, := 1 and bo := 1I*.
Then by (2.4.7) of the previous section,
%-, n. From (2.4.4) we have
provided that
Next
Hence
and
48
Here we have used (2.5.4) and a geometric estimate. From the above
,T.The 24th will produce more than 45 million digits at the expense of only a
few hundred arithmetic operations. A more exact asymptotic will be derived
in Chapter 5. (See Exercise 5.) Note also that the number of leading zeros
of y,+, gives the number of digits of agreement between v, and ,T to within
1.
The second algorithm, based on an identity of Gauss [1866], was discovered by Brent [76a] and Salamin [76] independently.
Algorithm 2.2
1-
24+1k 2
2 c,'
xi=,,
v m
where c, :=
= c ~ - , / 4 a nand a, and b, are computed by the AGM
iteration. Then v,, increases monotonically to ,T and satisfies
and
~r
IT.
The first eight iterations produce 0, 3, 8, 19, 41, 84, 171 and 344 digits,
which agrees extraordinarily well with the asymptotic.
Both of these algorithms generalize in many ways. (See Chapter 5.) At
the moment we only exhibit two additional identities.
If we use the differential equation for K equation (1.3.13), we may
rewrite Legendre's identity as
K1(k) ~ ( 1k')
,
K'(kf ) ~ ( 1k),
~ ( k )~ ~ (kl)
1 , kk'2 ~ ( k '~) ~ (k)1 ,
- = k'k2 IT
ll
K(k)
K (k)
1 + ~ n
1+x,
( 1 - k lK(k)
) a ,=, (-)+(I-k)-
I+y;
rI (-)1 + x , *
v=
50
cumulative error given by (2.5.3) and (2.5.7). We will say, informally, that
any such algorithm allows for quadratic or fast computation. In similar
fashion, we talk about mth-order computation if the cumulative error (i.e.,
digits correct) is of order mn after n steps.
Comments and Exercises
Algorithm 2.1 is derived in Borwein and Borwein [84a] by the method of
Exercise 1.
1. a) Differentiate (2.3.7) and apply (2.3.8) to establish that with
a, := 1 and b, := k t ,
,,.
3-,,.- lim
"+-yn+,-l
2'
3. a)
112"-1
lr
- 1)
and
11. There are actually eight natural products implicit in (2.4.7). One can
52
x
m
K'(k) =log(F)
16k' + 3
v2-"log(?)
K(k)
n=l
a,:=l
b,:= k'.
Theta functions are more properly considered as a function of two variables-a parameter q and an analytic variable z. So far we have considered
only special theta functions (z = 0). In this section we sketch some relevant
parts of the general theory. We write
&(z, q) := Ol(z, t) := 2qU4sin z - zq9l4sin 32
+ 2qZsf4sin 52 - . .
2.6
53
(2.6.3)
+ w)
8, ( z ) = -qe2iz81( z + w t )
8,(z) = qeZiz8,(z+ w t )
These identities show us that any theta function is entirely determined by its
values on any fundamental parallelogram
P(zo) := { z l z = z ,
+ r l w + r,nt, 0 Ir,, r, I 1) .
We assume as we may that the given function has no zeros on the boundary
of P(z,). It is obvious that z = 0 is a zero of & ( z ) so that w / 2 , w / 2 + w t / 2 ,
and w t / 2 are zeros of 8&), 8,(z) and 8,(z), respectively. Moreover, (2.6.4)
shows that z , + mw + nwt (m, n integral) is a zero of a theta function
whenever z , is. We now show that each theta function has exactly one zero
in each fundamental parallelogram. Thus we will have specified all the zeros
above.
Consider the integral
(2.6.6)
Proof.
ez(z)ez = e;(z)e;
ef (z)ef .
Consider
Here one takes the principal square root. The proof is left as Exercise 3.
It is equally simple to use the Jacobi triple-product of Chapter 3 to
produce product expressions for q(z) (Exercise 4). Various other relationships are indicated in the exercises.
Comments and Exercises
4.
i) 03(z) = Q,
n=l
n=l
Establish similar identities for 02(z) and 04(z). Many variations are listed
in ErdClyi et al. [53]. (This relies on Section 3.1.)
5. Use Exercise 4 to show that
(2)
i) -2_=4C
8&)
1- q2"
[-I
sin2(nz)
ii) log 63( 4 = 4 " (-l)"qn
2n
63(0)
.=I 1- q
n
6.
56
IJ-T,2
?rl2
max
-m/2sw~m/2
If(w - &)I
2.7
57
(This relies on Exercises 2a) and 2b) of the previous section.) Then
replacing z by ~'3;' and p by y = p'3,/82, we observe that, since k2 = 8;/8:,
This is solved by
58
which solves the inversion problem (at least for real k) for the given
integral. We wish to have cn2 sn2 = 1 and k2 sn2 + dn2 = 1, and it is
appropriate to define
and 0 I r h is given by
sin
:=
VGT-aj
then
(2.7.8)
(1 + k')
Io*
1- k2 sin2 o
dB
I/IT~Z%
u, = (1 + kt)u
1-k'
k, = 1+kt
sn cn
sn(u,, k,) = (1 + k r ) dn
since kkyl" = 1+ kr. The change of variables sin 8 = sn(u, k) and sin 8, =
sn(u,, k,) now produces (2.7.8). 0
Comments and Exercises
A profusion of information on the numerical use and derivation of various
Landen transforms is given in King [24]. More bibliographic information is
available in Watson [33] and in Whittaker and Watson [27].
1. Prove the differential equation (2.7.2) by showing that
is doubly periodic with periods r and rt12 and that +(z) has simple
poles possibly only at r12, r 1 2 + rt12, and translated points. Then
show that +(z + rt12) = +(z), and hence relative to the periods r and
r t / 2 , 4 is doubly periodic with a single pole. Thus 4 is constant.
Establish (2.7.5).
With cn and dn defined as in (2.7.6) and (2.7.7) show that
cn2 + sn2 = 1, dn2 + k2 9n2 = 1, and that cn(0) = dn(0) = 1.
Show that (dldu) sn(u, k) = cn(u, k) dn(u, k).
Show that sn is strictly increasing on (0, k) with sn(0, k) = 0 and
sn(K, k) = 1. [You may find it convenient to use the product
formulae for O,(z) and 8,(z).]
Show that dn(K) = kr.
Establish the double periodicity of sn(u, k), cn(u, k), and dn(u, k)
with respect to u and verify the following table:
sn
cn
dn
Periods
Zeros
4K,2iK1
4K, 2(K iK')
2K,4iK1
2mK + 2niK'
(2m 1)K + 2niK'
(2m l ) K + (2n 1)iK'
Poles
+
+
2mK
2mK
2mK
Residues
+ (2n + 1)iK'
+ (2n + l ) i K 1
+ (2n + 1)iK'
(-1)"lk
(-l)"+'ilk
(-l)"+li
and
then
('
These transforms and their analogues for incomplete second and third
integrals clearly lead to quadratic iterations which are studied in detail
in King [24] without reference to theta functions. Note that if $ = -12,
we recover the quadratic transformation for K.
a) Show that, in the notation of Theorem 2.6,
cos $, =
v m $
1- k'
sin (2$ - $, ) = -sin
1+ k'
= k,
sin
n=l
1- 2qZncos ( 2 4
- 2q2n-1 cos ( 2 4
+ q4"
+ q4n-2
61
Obtain similar expressions for cn(u + v) and dn(u + u). (See Whittaker
and Watson [27] and Exercise 11 of Section 1.7.)
Chapter Three
Proof.
63
t:
co(q)~4(q2
=)
(1 - q4n-2).
n=l
ut on replacing q by q2,
Hence
11 (1 - q 2 m ) .
m=l
Cl
64
Also
(3.1.6)
e3(q) = Q,Q; =
n=l
(1 - qZn)(1+ q2n-1)2
4, this gives
Finally, k := $ and 1:= 3 ' 4 give two formulae which play a central role in
the Rogers-Ramanujan identities (Section 3.4):
n
m
(3.1.12a)
(1 - q5"+')(l - q5n+4)(1
- qSn+')=
n =O
n=-m
(-l)nq(5n+3)n12
We finish the section with a sli htly less immediate corollary of the triple
product. If q := q1I2 and x := ql/'w, then (3.1.1) becomes
66
6.
IT:=,
(1 - q6") = 1.
where q := emit.
From the discussion of Section 2.6 show that both sides of (3.1.17)
are analytic with zeros at z = (n + $.)?rt + m?r (n, m integral). By
Liouville's theorem they differ only by a multiplicative constant.
Use Exercise 5 to show that this constant is 1.
Justify taking the limit in (3.1.14).
Prove that (3.1.15) is equivalent to
Observe that (3.1.18) has the following number theoretic interpretation due to Catalan (Dickson [71, vol. 21). The excess in the
number of even values of x + y + z in
67
b)
For the second equality see also Proposition 3.4 of Section 3.3.
Use
68
when q := e - n K ' ( k ) l K ( k )
We next derive another beautiful identity due to Jacobi, in which 8:
denotes the derivative of 8, with respect to z at zero. [See equation (2.6.1).]
Proof
and
m
and
We leave these identities as Exercise 6 and note that (3.2.4) and (3.2.8)
have number theoretic interpretations like that of (3.1.15). These three
identities and their remanipulations are among the very few known reductions of three theta terms to one known theta expression. (See Glasser and
Zucker [80].)
Following Weber 1081 and others, it is usual to identify the following
quantities. With q and r as above and r := i<r,
7 = q(v=7) = ~
( 7:=
) q1112~o
=A(*)
= f2(*)
= f2(r) := 2
=f(*)
= f(r) := q
112 1112
q Ql = (4k2/k')1112
Q2 = (4/kk')1'12.
-1124
K = -IT$ f 4 = - *
2
2kr
2 4
'
f*=-7)
2k
2 4
f 2 .
5'
(-l)m+n -4IT
=
log [f(-)l=
n,m=-mm + rn
IT
Here and hereafter, the prime over a summation indicates that the term
m = n = 0 is omitted and summation is over expanding rectangles. One
establishes (3.2.12) by writing
= IT
(-l)*
" "
2" cosech ( M k ) = 2 2
V7k
k=l
k = l m=-w
(_,)*+,
rk
since C:=, (-l)"+' lm2 = 7r2/12. This is the desired result. One may observe that whenever one can evaluate the double sum, one can also evaluate
kk'.
In some future work we will be following Ramanujan rather than Weber.
Ramanujan studied
Gn:= (2kk')-1112
= 2-l14f(-)
For some purposes these give slightly cleaner results. For example,
(233
kk' - = 4-12;
so that
We also have
and
Proposition 3.2
For Iql< 1,
I;:(
d
e ; ( q ) = -4q - - 3 = 8 q - log dq
),
+1
For 141< 1,
(3.2.23)
and
Of course, implicit in (3.2.23) is a formula for the number of representations of n as a sum of four squares. We discuss this more fully later.
72
b) Show that
73
5. Show that
g4n = 2 1 ' 4 g n ~ n
Gn = GI,, and g,l = g4,n
4 ( ~ , G , ) ~ ( G-I ~gjf) =
6. a) Prove Lemma 3.1 by expanding both sides.
b) Prove (3.2.7) and (3.2.8).
c) Establish Theorem 3.2.
d) Prove. (3.2.16) and (3.2.17). Try to prove (3.2.18). Exercise 9 will
help.
The next two exercises sketch out some facts about Jacobi's imaginary
quadratic transformations.
a)
b)
a.
8. a)
b) Now use Exercise 7a) and (2.3.17) and (2.3.18) to show that when
k, := k-l, kE(kl) = E(k) - ( k ~ ) ~ ~ ( k ) .
Derive
similar formulae for E(k,) for the other three imaginary
c)
transformations.
l/Z.
d) Evaluate K'IK when k:= fl+
9.
c)
74
,,=--
1
k2
-E l o g ( m )
= - n-s
iT
(3.2.28iii)
1
-log Q 2 = 2
z
1
7
m
02
1
log Q,Q: = 1% --m= n ( e z ~ n+~
2q
= 21- C1 1 - tanh ( r n s )
(3.2.28~)
(3.2.28vi)
2 log Q , Q ~= 2 log 0,
m
2K
(3.2.28vii)
1
- log Q,Q:
2
=-
51 1% 04
C tanh-l
1
(3.2.28viii) log
(F)
Q1(qZ)
"
(-I)n
=C
Qz(q )
1 n(e2"n"
l)
1
(
l
)
"
[
l
- tanh ( r n s ) ]
=-C
2 1
n
rs
--
(3.2.28ix)
1
e-""s = - 4 log
1 - k'
These are taken from Zucker [84]. Many more identities follow by
differentiation or as below.
a) Show that
.=,
2n + 1
~ / 8
~ / 2 4
p:=l
p:=3
'
and by
This allows for any complex value of s , but we will consider only integral
values. Thus for n in N, ( q ) , = fly=, (1 - q') and (q)7f, = 0. For m < 0 or
( )
(c)
(i)q
is a polynomial in q.
77
F ~ ( Yq), = F ~ - I ( Yq)
, + yqnFn-l(y, q ) .
One can also argue, combinatorially, that the coefficient of ysq' on each side
gives the number of partitions of t into s distinct parts not exceeding n.
If we now let y := x ~ and- n ~
:= 2 N , we obtain, after some manipulation,
For k = 1, ,2,
...,
Obviously the result is true for k = 1. Alternatively one can argue that the
coefficient of x t q h n each side counts partitions of s into t parts not
exceeding k.
Comments and Exercises
78
2.
Verify (3.3.5).
3.
Proof. Note that the above sums are all finite. Set k := n - m and
x := q2m in Proposition 3.4. Then multiply each side by ( q ) i A . We get
80
If we let N tend to
m,
we arrive at
3.5
81
Show that (3.4.1) says that the number of partitions of n into parts with
minimal difference 2 is the number of partitions into parts congruent to
1 or 4 modulo 5. Equally, (3.4.2) says that the number of partitions of n
into parts with minimal difference 2 and minimal part 2 is the number of
partitions into parts congruent to 2 or 3 modulo 5.
Verify the equivalence of (3.4.1) to (3.4.3) and (3.4.2) to (3.4.4).
Derive the second identity (3.4.4) from Theorem 3.3. There is an
interesting continued fraction associated with the identities.
Our first application is a proof of Jacobi's formula for r4(n), the number of
representations of n as a sum of four squares [including sign and permutation so r4(2)= 24 and r 4 ( l )= 81
Theorem 3.4
For each positive integer n ,
to
0:(q)=1+8):
3r n q m k = 1 + 8 2( C d ) q n
m=O k = l
n=l
dln
The subsidiary conclusion is Lagrange's famous result. Our second application is an analytic proof of Fermat's theorem that any prime of the form
4k + 1 is the sum of two squares. It is convenient to define
and
(3.5.2)
where odd(n) is the odd part of n (that is, the largest odd divisor of n). We
set w(n) := 0 for n 5 0. The proof of the following lemma is left as
Exercise 3.
Lemma 3.3
(3.5.3) w(n) + 2
(-lliw(n
83
2
n =r
otherwise
- j2) = (2(i1)
jzl
'
Thus some wxp - j2) is not divisible by 4 and Lemma 3.3 implies that
p = j2 + k2 for some integers j and k.
Our third application is to establish the following reciprocity result for
Gaussian sums. Let
Theorem 3.6
For positive integers p and q with pq even,
S(P, q ) =
63
q l-i-
S(q, P)
Lemma 3.4
For pq even and q positive,
Proof. Write
@,(E
+ iplq) := C;=-,
. ~h en
e-mn2(~+ip/q)
Since S( p , q ) = Z:=,
~im
&'+OC * e 3 ( E '
- 3 + 0(&'2)) .
+ 4 ) is divisible by 5 .
+ 5 ) is divisible by 7 .
where k := 1 + ( 3 n + l ) n / 2 + m(m + 1 ) / 2 . This uses the triple-product identities (3.1.10) and (3.1.14) multiplied together. One now considers when k is
85
divisible by 5. Since 2(n + 1)2+ (2m + 1)' = 8k(mod 5), we must have
2(n + 1)' + (2m 1)' divisible by 5. An inspection of residues shows that
this can only happen if 2(n + 1)2 and (2m + 1)' are both divisible by 5.
Hence, 2m + 1is divisible by 5 and so is the coefficient of q5m+5 in qQ4(q).
From the binomial theorem one deduces that
in the sense that all coefficients are congruent. It then follows that Q(q5)/
5m+5
in
Q5(q)= l(mod 5), and hence the coefficient of q
is divisible by 5. But
Case (b) is similar, but uses the square of (3.1.14) instead of the product
with Euler's series. (See Exercise 5.) 17
Comments and Exercises
The identity (3.5.1) was discovered by Jacobi on April 24th, 1828. He also
subsequently observed similar number theoretic interpretations of the formulae for 0; and 0:, and he gave an arithmetic proof of his theorem. The
identities can also be found in Gauss's unpublished work. A wealth of this
and similar information can be found in Dickson 171, vol. 21.
An analysis of the components of our proof of (3.5.1) shows that it is
rather simpler than that in Hardy and Wright [60], which proceeds from
r,(n), or in Rademacher [73], which uses elliptic function arguments. We
use only the triple-product identify and the AGM.
1. Show that r4(n) is 8 times the sum of the odd divisors when n is odd,
and r4(n) is 24 times the sum of the odd divisors when n is even.
The argument of Theorem 3.5 is due to Ewe11 [83]. We give the general
formula for r,(n) in Chapter 9.
3. Prove Lemma 3.3. Consider odd and even cases and use the multiplicativity of a,.
Gaussian sums arise naturally in the study of cyclotomic polynomials and
hence occurred to Gauss while studying constructible polygons. Apparently
this led Gauss to the lemniscate and thence to elliptic functions. The
formula (3.5.4) (with p :=2) plays a key role in establishing the class
number formula for binary forms. Landau [58] is sufficiently taken by the
result that he presents three proofs. The result we give is due to Dirichlet.
The case with p := 2 is due to Gauss save for the "detail" of determining the
sign of the complex square root. (See Landau for the significance of the
sign.) Our proof follows Bellman [61].
4.
6.
2m )zk
a) ( z
k=-m
3.6
fi
87
c)
-I12skd12
+ e -(s+2wrk)l21
2
e)
3.6
n1I2
7
(-)n +2nr ,
e -x2
lim 'n
= x.
V7-i
Deduce from c) and d) the following form of the general theta
transformation:
ii)
lim
n--
=-
if
n+m
We continue our tour through theta function theory with a discussion of the
Riemann zeta function. This also allows us to catalogue a few useful
properties of the Mellin transform for future use.
The Mellin transform is a specialized Laplace transform defined by
lom
f(xy)xs-' dx = y-"
/om
f(x)xs-' dx
y >0 .
88
Under mild conditions, the Mellin transform is invertible and one can
identify two functions whose transforms agree for re(s) > 0. (Certainly this is
true if the functions are transformable and continuous.) This is an appropriate place to state the following functional characterization of the gamma
function, whose proof is left as a guided exercise.
Theorem 3.8
The gamma function is the unique function f : (0, a)-t [O, w) such that
( 1 ) f(1) = 1
( 2 ) f(x + 1) = xf(x) for x > 0, and
(3) log f(x) is convex.
Many otherwise tedious facts are easy consequence of this functional
characterization of the gamma function. We list three whose proofs are left
as exercises. These are the functional relation r ( s ) r ( l - s) = rlsin r s
(1.6.6), the beta function formula p(s, t) = r(s)r(t) / r ( s + t) (1.6.7), and
the duplication formula
We now derive the functional equation for the Riemann zeta function
c(s) := Cz=l n-" re($ > 1. We observe in passing that 5 has an immediate
analytic continuation to re@) > 0 simply by writing
so that
(3.6.4)
l(s) =
-g( - y + l n - s
n=l
Thus
re(s) > 0 .
- 11/2.
3.6
Here we have used the theta transform for 8, (2.3.1) to substitute for g on
[I, w). This leads to
Thus
0 5 g(x) - log
n!nX
5 x l o g (I
X(X+ 1)-- .(x + n)
+ );
and
r(x) = lim
n+-
n!nx
x(x + 1) - - . (x + n)
=fW .
a) Prove formulae (1.6.6), (1.6.7), and (3.6.3). In the latter case write
the ostensible identity in the form r(x) = f(x) and verify that f
satisfies Theorem 3.8.
b) Establish that I? has an analytic continuation to the entire plane
with simple poles at the negative integers and zero, and with no
zeros.
fi
90
3. Let
-(t+x2/41) f 1 1 2
dt.
a) Use (3.6.4) and the fact that [(s) has a pole with residue 1 at 1to
show that
l ( ~ =)
(1 -p-s)-l
re(s) > 1
p prime
6.
(Stirling's formula)
The formula is
Now replace u by t
m and obtain
3.7
91
Since the theta functions provide quadratic analogues for the geometric
series, it is natural to ask about their relationship to sums of the form
m
(3.7.1)
C a,'
where
a,,, := Ma,
+ Nan-,
n=l
(Exercise 1) and
a+p=M
ap=-N.
We will consider only the case in which N = + 1 and M is real. We may write
a,,, = (2c)a, + &an-,, Is t = 1, and to assure that a , p are real, we assume
c>max (0, -6). Then ap = + 1 and
The following proposition provides the key. We define the Lambert series
Proposition 3.5
For
and
92
Similarly,
and
pn and the second now follows from (3.2.27), the formula for
Proposition 3.5 will allow us to sum C:=, a,' for certain starting values.
Specifically, we can handIe E = +-Iand A = + B in (3.7.3) and (3.7.4).
CASE
(i)
(E=
-1)
as follows from (3.7.2) and Proposition 3.5 (iv). Similarly, one can
evaluate
and
m
C a;,:,
n=O
=2
V T i [ L ( p ) - 2 L ( p 2 )+ L ( p 4 ) ].
(ii) (E = 1) Then
ap = -1.
Now
C a:;
E2GT-i
93
P < 0 and
1
= 2 V 2 z [ L ( p 2 ) - L(p4)]
a2" - p2*
n=O
,,=I
c. Q;,:,=2GTiC
-- rn
2
aZn+l lplZnf1
ce:(l~I) - G(P"I
-.
CASE
(i) (E = -1)
Then
(ii) (E = 1) Then
and
In certain cases the theta series involved above are particularly simple to
evaluate. For example, in Case 2(i), we have p = c - -.
Thus c =
( p p -l) 12. If p := lo-", the series 8,(P) and 8,(p2) in (3.7.8) can be
evaluated entirely by writing down sequences of 1's and 0's. Thus for
2c = 10" + lo-", the sums in (3.7.8) can be evaluated at the same speed as
one can multiply n-digit numbers. (See Chapter 6.) Moreover, since theta
functions can be fast computed for any algebraic P, the series are always fast
computable. (See Chapter 7 and Exercise 6.) Hence the series of Exercises
3, 4, and 5 are quadratically computable.
Finally, consider (3.7.7) for c := sinh ( m ) and with s > 0. Then
an+,:=2sinh(ns)an+a,-,, ao:=O, a l : = l , and
m
(3.7.10)
S(s) := C a Z 1 =
n=O
cosh ( m )
[O,(e-?" - ~ ~ ( e - " ~.) ]
2
where
on using
For singular values of k (see Section 4.6), this formula becomes particularly
pretty. [See Exercise 6b).]
Comments and Exercises
1. a) Show that when x2 = Mx + N has distinct roots a and p, an+,=
Ma, + Nan-, is solved by (3.7.2).
b) If a = p, show that
and
5.
L e t a o : = 0 , a l : = 1 , a n d a n + l : = 9 . 9 a n + a n ~Showthat
l.
6.
96
Og(e-") - O:(e-"')
sech [(2n + l ) m ] =
'7
and
O;(e-"")
co
sech ( n m ) =
+1
'7
d) Show that
x
a
(2 - f i ) r 2 ( $ )
n=O
7.
1 6 ~ 7
Equations (3.2.28) and (3.2.29) can be used to derive closed forms for
a host of other reciprocal sums. (Fn and Ln are as in Exercise 3.)
a) Use (3.2.29) to show that
where p := (3 - ~) 12.
b) Use (3.2.28iv) to show that
1
5--.,, 2nFzn
v3 log
12
where p := (fi
- 1) 12.
Use (3,2.28i) and transformation formulae to show that
1 2
l+k'2K2-KE
cosech2(nas) = - + 7
n=l
6 T
x
m
i)
[(?)
ii)
n=l
m
iii)
n=O
y)
4 [(q)
[(
1+ 7
cosech2 (2nm) = 6 T
2 coseeh2[(2n + l)?rs] =
i)
2EK 1
2 sech2( n m ) = -m2
2
n=l
K2
K2 - KE].
ii)
n=l
m
iii)
77
n=O
Show that
ii)
C sech2
n =O
27T
'
and
and show that a similar formula holds for all recursions of the form
an+,=(2c)an+ an-,, ao:=2c, and a;:=l.
Show that
Show that
-2
2 (-1)"m
cosech (2mm) =
m=l
2 sech2 [(2m + l ) m ]
m =O
9. Show that
and
10. a)
b) Hence show that all the series considered in this section can be
computed with at most O ( f i ) operations for n digits. (See
Chapter 6.)
A related formula is
;i
ai" ,
Show that
where q := (fi
- 1)/2.
K
-=-=
2-2
47r
~~/lo~[(V7-1)/2]
I
63[e
4 log [(fi
+ 1) 121
where the last equality follows from the theta transform. One sees that
K E / ~ T 3~ is~ close
+
to
xn
where
i)
%--1-x"x n
C,
r(n)xn
n=l
if n is square
if n is nonsquare.
1=
where
rn
n = n[ pfi
i=l
c)
Show that
where p := (3 - f i ) / 2 .
Zucker [79] gives general formulae for sums of powers of hyperbolic
functions (in which the coefficients are defined recursively and have been
computed extensively by Ramanujan and Zucker). Using these one can
~
of 8 when
evaluate C:=, (- I ) " + ' F , ~ ~ and C;=, F , ~ ~in+ terms
k := 172,3,. . . . There are similar Lucas number results, and if K and E are
used, many more sums are expressible. We give two examples:
and
Here, as before, P := (3 - v 3 ) / 2 .
13. L e t u , : = O , u , : = 1 , a n d u n + l : = a u n + u n ~ l . L e t v o : = 2 , v l : = a , a n d
v , + ~:= av, + vn-I .
a) Establish that
b) In particular,
c)
Compare
and
ii)
x"
n=1
d) ~ h o kthat
k2
Chapter Four
102
103
M(6 k ) dy
dx
- k2x2)
v
(
1
x2)(1
-y2)(l - l Z 2 )
1-y
l+y
-=
( P - X Q ) ~-.
I-x
( P + x Q ) ~1 + x
(4.1.7b)
l - l y = ( l - k x ) ~ ~ / ~1 + l y = ( 1 + k x ) ~ 2 1 ~
104
1 - = riv- vu
Q(0)
-=1+2M(1, k)
ABCD
P(0)
degP=2p
degQ=2(p-1).
For n = 4 p + 3 ,
(4.1.11)
deg P = 2p
deg Q = 2p
then
(4.1.13) becomes
.1.15b)
k f 2=
(1 - a ) ( l +
2a+1
and
lf2=
( 1 + a ) ( l - a)3
( 2 f f+ I ) ~ *
(4.117)
In the associated variables u := k1I4 and v := ill4 this has a simpler form,
u4 - v 4 + 2uv(l - u2v2)= 0 .
See Section 4.5.) The muItiplier M has any of the following forms:
- .
zv3-
( n = 5). We have P = 1 + px2 and Q = a. Equation (4.1.13) leads to the three equations (h2:= k 5 / l )
QUINTIC TRANSFORMATION
'
106
(4.1.21)
u6 - u6
u4u4)= 0 .
The transformations are called cubic, quintic, and so on, because of the
underlying order of the transformation (and so of convergence).
Comments and Exercises
The very classical approach of the section to cubic and quintic modular
equations is due to Jacobi [1829]. This algebraic approach was extended to
the septic (n = 7) case by Cayley [1874], who also treated the endecadic
(n = 11) case partially. The calculations are formidable. We have followed
Cayley closely in this discussion. The associated variables u := k1I4 and
v := l1I4 of JacobiYsconsiderably simplify the calculations, as we will see in
Section 4.5.
In order to find the relationship between K(1) and K(k) implicit in (4.1.4)
we must show that the underlying transformation (4.1.5) is one to one and
onto on the interval [0,1]. This can be done directly for n = 3 (Exercise 3)
and other small n. However, as with most of the details, it is easier to use
the general transcendental approach of Sections 4.3, 4.4, and 4.5.
1. a) Show that an equation of the form (4.1.5) that is invariant under
the change of variables (x, y)+ (1 lkx, llly) exists. Hint: Set
Y(V, U) = ( v 2 - u 2 ) ( v 2- Z ~ T J ~ )
and
and
D-R
2
and
108
and
Hint:
K'(u4)
K'(v4)
-- 5 -.
(4.1.26)
K(u~)
K(v4)
so
does ( u , v ) = ( z " / ~ , k' 1 4 ) . NOWExercise 1 of Section 1.5 shows that
the two sides of the above equation differ by a constant. Use the
logarithmic asymptotic at 0 and the relationship between u and v as
v +0 to evaluate this constant.
(4.1.27)
k(q):=k=%l)
kyq):= k'= G(q)
q = e-nK'(k)lK(k)
From Exercise l e ) of Section 1.4 and Exercise 5 of Section 4.1 we see that
the quadratic modular equation (4.1.3) is satisfied by 1 := k(q1'2) and
k := k( q), while the cubic equation is solved by 1 := k ( q 1 1 3and
) k := k ( q )
and the quintic equation is solved by 1 := k ( q l 1 5 ) and k := k( q). [To see this
just observe that (4.2.4) uniquely determines q.] In general we will see in
the next sections that the pth-order modular equation for k is a polynomial
in two variables with integer coefficients that is satisfied by k ( q P )and k ( q ) .
We observe from (4.2.4) that for these algebraically connected moduli
110
and
Now
(h+j+1)2+3(h-j)2=(2h-j+ $)'+3(j+4)'.
Thus subtraction of the two theta identities produces
m
63(qP3(q3)- 4(q)64(q3)= 2
IC
m,n=-m
(m+ f)'+3(n
+J ) 2
m+n even
111
that
O 5 k(qpn)5 4eWCp" 4 E
and that
3.
Hint: Write
112
(See (4.5.4).)
Use part a) with a := 7 and b := 1 to establish this formula.
Hint: Set x:= y : = + l to find a formula for o , ( ~ ) B , ( ~ ~ ) +
B ~ ( ~ ) B ~ ( ~Now
: ) . set x := y7 .- k Y 7 to similarly write
62(q)ez(q )f82(-q)ez(-q7)=ez(q)ez(q )- On making
rearrangements this yields
c)
as required.
Establish the cubic modular equation (4.2.7) as above.
113
at + b
ct + d
a, b, c, d integers,
ad - bc = 1.
w=- at + b
ct + d
U {It1 = 1 and -
4 5 re(t) 5 0) .
u 1121+ 11= 1) .
These two sets described in a) and b) are fundamental sets for the T- and
A-groups. The interiors of these two sets (F! and F:) are fundamental
regions in the following sense.
114
Theorem 4.2
of some point
The function
is a A-modular function.
A(-)
= h(t)
h(im)=O
h(O)=l
h(+l)=m.
The first value is immediate from (4.3.1), while the value at zero can be
alculated from (4.3.4). (Observe that as t-0 in FA, llt-+a in FA.)The
alue at 1 is computed from Jacobi's imaginary transformation
- (See Exercisc
'
The function
116
This is only the very tip of the iceberg. We have restricted our attention to
two particular groups where we can directly establish the existence of
modular functions. In general this restriction is unnecessary. Only slightly
further into the theory are results such as: any modular function takes each
complex value the same number of times in the fundamentaI region. An
important consequence of this is that h takes every value exactly once in FA
and h has a well-defined inverse that has branch points only at 0, 1, and
(Exercise 10). J has similar properties on F,. One can now prove, much as
for elliptic functions, that two nonconstant functions which are modular with
respect to the same group are algebraically connected. Furthermore, if one
of these functions is univalent on the fundamental region, then the other is a
rational function of it.
This wide-ranging and difficult body of theory that is intimately tied in to
many questions in number theory and algebraic geometry may be pursued in
any number of texts, such as Apostol [76b], Chandrasekharan [85], Lang
[73], Lehner [66], Rankin [77], or Schoeneberg [76].
Two of the seminal papers of the subject, both dating from 1882, are due
to Klein and PoincarC (selections of which may be found in Birkhoff [73]).
Poincare was interested in studying linear differential equations with algebraic coefficients. Klein, who considered this his main field of work, in
keeping with his Erlanger Programme had more algebraic and geometric
interests (Klein and Fricke [1892]).
117
and
T~:=(!
-:).
and
1 0
TA:=(2
(: f;) E T.Show
where the b, are integers. Show that llA(t) is finite at every point of
F, - {ia).
a) Prove Theorem 4.4 from (4.3.4) and (4.3.6).
b) Let q := e2"': Show that
-24
) .]
as t+ i a .
Hint: Let f(q) := f(t) where q : = e2"" for t in F,. Show, by
modularity off, that f is meromorphic in q in a neighbourhood of
zero. Show also that f has a pole at zero. Thus f has a convergent
+expansion at i a with finite principal part.
b) Show that the r-modular functions form a field. Likewise the
A-modular functions.
Establish that f y , f y , and
modular.
f24
4.4
i)
a, b, c, d integers,
ad - bc = p
120
+ 1 functions
AiS=Ajmodr
Thus by the modularity of J ,
forsomej
121
which with part (b) of the lemma finishes the proof. The second part is
identical via parts (c) and (d) of the lemma. 0
The modular equation for A of order p is the polynomial
P
wp(x,~):=n(x-Ai)
ni:=no~i.
i=O
This is obviously of degree p + 1 in x and has a root at each A,. Note that
Ap(t) := A( pt) and Ai(t) = A((t + 2i) lp), i <p . Thus as functions of q,
iqliP), where a = 1. We now show that
Ap(q) = A( qP) and hi( q) = A(CY
(independent of t) Wp is also a polynomial in A. This relies on two basic
facts. First, any symmetric polynomial in the Ai is A-modular and second,
any A-modular function is a rational function of A.
Theorem 4.7
Wp(x, A) is a polynomial of degree p + 1in x and A with integer coefficients.
The coefficients of x p f l and AP'l are both 1.
Proof. Consider q ( x , A) := II;=, ( y - xi) , where Ai := 16/Ai and
y := 16lx. This is a convenient form to work with. Observe that Wp and WP
are connected by
n
P
A) .
16p*1~p(x,
A)= [ x ~ + l
i-0
xi
A(~)=
n=l
q,n-l
)8
"
A:= - = - +
anqn
A
q n-o
ii=
1
aq
A
+ C ananiqnip
n=O
= qP
-l+ Cmand"'
n=O
i<p
122
where a is a primitive pth root of unity. It can now be established that there
are integers ci so that
and si- P( h) has zero principal part. This is easily proved. First remove the
term by considering
q-(p+l)
and then proceed inductively. Observe from (4.4.4) and (4.3.5) that the
only candidate for a pole of si- ~ ( h is) q = 0 but P has been chosen so that
si- ~ ( h )is finite at q = 0. Thus we see that si- ~ ( h is
) a bounded
A-modular function and is hence, by Theorem 4.5, constant. Since
A) when
Theorem 4.8
The modular equation for j of order p
P
(4.4.6)
FP(x,j):=(-j)
ji:=j0Ai
i=o
+ 1 in x
and j. The
Suppose that
l
m
f(q) :=
cnqn
cn real.
n=-h
[f(anqnlp)]m
m integer
n=l
and
c)
for some a. Equivalently MST-'M E r for all M E Tp. One can show
4.5
- v FORM
The actual calculation of the modular relation for h is most readily carried
out in the associated variables u := k1I4:= A 'I8 and v := 1'I4 := x 118. Analogously to Theorem 4.7, though with a few substantial additional details (see
Exercises 1 and 2), we have the following theorem.
Theorem 4.9
If p is an odd prime, then the modular equation in u-v form is given by
where
a, (v, 1) = (v f
- 1)
flp(v, U)= -.np(-u, v) = Op(-v, -u) .
The coefficient of up+' is -1 and the coefficient of up+' is 1.
The most striking property of the modular equation in u and v is the
"octicity." Because ui(qP) is of the form aiq118f(a8iq),where f is analytic in
q, only every eighth coefficient of the modular equation is nonzero. (See
Exercise 3.) We illustrate with some examples which give the nonzero
coefficients and the column sums.
u3
uZ
4.5
132
- v Form
133
134
Exercises 1 and 2 outline the proof of Theorem 4.9. Some of the details
are rather complicated. The flavour should come through.
1. a)
and
uoTA=u
s?)
2.
3.
Prove the "octicity" of the u-v modular equation. That is, for p = + 1
mod 8 the nonzero entries of the table associated with a, lie only on the
main diagonal and every eighth sub and super diagonal, while for
p = 2 3 mod 8 the nonzero entries are only in every second entry of
every fourth diagonal.
4.
5. Show that
a, can be written as
- v Form
a)
(p2-1)18
(p-1)12
+ 0(u2) while
uo = fi/Zq118p
+
and
C1,l
- -(-1)(~~-1)18lim up .
u-0
Uo
c)
> 0 ,has
Hence if wP(y:, y:+,)=0 with 1>y , > ~ ~ + ~ one
136
8.
(0,1,2,3,4,5)-,(1,2,3,4,0,5)
TA: (0,1, 27 37 4, 5)+(0, 5, 3 , 1 , 2 , 4 ) .
b) For p = 7
s,:
m i , 2,3,4,5,6,7)-, ( 1 , 2 , 3 , 4 , 5 , 6 , 0 , 7 )
T,: ( 0 , 1 , 2 , 3 , 4 , 5 , 6 , 7 ) - , ( 0 , 3 , 1 , 4 , 6 , 7 , 5 , 2 ) .
Show, using a), that the Galois group of W, contains A , and is not
solvable.
d) Show, using b), that the Galois group for W7 is not solvable.
In both cases these permutations actually generate the Galois
group. (See Exercise 10 of 4.4, and Exercise 1 of 4.5.)
c)
4.6
THE MULTIPLIER
4.6
The Multiplier
Mp(l, k ) :=
'fa4
K(k)
B ; ( ~ " ~- )
-
KO
=k(q)
11t2 ,dk
p"p=---k k t 2 dl
v ( 1 - u s ) du
~ ( 1 u- s ) dv
'
Proof.
Tr
Lt)=-Zm.
dl ( L
1
l+lt
M2(l, k ) = -= l+k
2
ich corresponds with Theorem 2.6.
138
+
= 1, by equation (4.2.6). This proWhen p := 3, we have
duces, on differentiating implicitly and using (4.6.2),
because
(4.6.7)
M7 =
v(1- uu)[l - uv
7
v -u
+ ( u v ) ~ =] -
U-U
7 u ( l - uu)[l
- uv
+ (uv)']
'
(See Cayley [1895].) Many other multipliers have been calculated and can
be found in Ramanujan's collected works, Cayley [1874], Tannery and Molk
[1893], in Weber [08] and elsewhere. The main technique for larger p is via
manipulation of theta series. Thus one has Ramanujan's form of the
multiplier for 13:
We list also
(4.6.9)
17Ml7(l, k) =
(f)
(k) + (g)
112
112
( )""
11'
((:)lI4
-2 7 l+ kk
112
)"'I
+ ((f
7
1
M17(k7 '1
'
139
=-
2k,11 = 1
.\/z
l,=vZ
' Fz
- 3 l I 4 ) ( v 3- 1 )
I =
fi(v3i1)
1,
m+m2kSl,
I,
2k-1-= -
= fi- 2
'
(v7+ 3 l I 4 ) ( v 3- 1)
2
1. From the results of Section 1.5 (in particular Theorem 1.5 and Exercise
1) we know that G ( k ):= k 1 I 2 k ' ~ ( ksatisfies
)
G ( k )=
q%
G(1)
140
where k and 1 are solutions of the pth-order modular equation W,. Use
this to show that
3,
5. Verify the singular values in (4.6.10). In each odd case one verifies
t, = 2k,kA first and uses
141
(VIO - 3)'(4
- dE)'(fl-
a ) ( 6-
m).
and
a*t + b*
c*t + d*
ad- b c = p , a*d*-b*c*=l.
t* =
Then j(t*) = j(t). Also t* solves a quadratic with the same discriminant as that for t, and t* possesses a multiplication by p if and
only if t does. Now (4.4.6) implies Fp(x, j) = 0 is solved by j(t),
because S = A, m o d r for some i and hence j(t) = ji(t). Thus
F,(j, j) = 0 is solved by j(t) exactly when t possesses complex
multiplication. Note that F,(j, j) will have lower degree than
Fp(x7j)Since j is one to one on the fundamental region, j(t) = j(t*) if and
only if the associated primitive binary quadratic forms are properly
equivalent. A binary form ax2 + bxy + cy2 = 0 is primitive if
gcd(a, b, c) = 1. Two forms are properly equivalent if there is an
integral linear transformation of determinant 1 converting the one
into the other. (See Dickson [71, vol. 31.)
Thus the study of the degree of F,(j, j) becomes the study of
h(- D): the class number of primitive forms of negative discriminant -D. More of this is sketched in Hardy [40, chap. 101, and in
Tannery and Molk [1893].
142
7. For fixed m and p let yo := k, be the mth singular value and let y, be
the sequence of solutions of w,(~:, y:+,) = 0 as defined in Exercise 6 of
Section 4.5. Use Theorem 2.3 and (3.2.3) to show that
When
This provides an algorithm of order p for computing e-&.
p := 2, this reduces to the algorithm in Exercise 3 of Section 2.5.
By the same process as in Section 4.5 one can produce polynomials in
x := Mil(l, k) and k. Cayley [1874], following Joubert, gives
and
With
one has
(c)
82( 4')
[3,0-1]3=-4(-;;7;T)
kr3/k6
114
,
m
( 1 + q2n+1)3
= flq118(kk1)-114
n=O
and
144
This establishes the first equality for part (a). The second is most easily
seen by using equation (4.1.15), with y := k and k := I, to write k3/y =
[(2 + a ) l ( 2 a + 1)14 k'3/r1= [ ( I - a ) / ( 2 a + l ) I 4 and using (4.1.19) or
(4.6.5) to write 0;(q3)10:(q)= 1 / ( 2 a + I ) ~ Then
.
both sides of the second
equality become 4(2 + a ) ( l - a ) / ( 2 a + I ) ~ .
These identities have many remarkable consequences some of which we
leave as exercises. Two, however, are worthy of more explicit analysis.
AN ITERATION FOR THE CUBIC MULTIPLIER. In decreasing form, as above, we
consider the multiplier M, := K,, ,IKn where K, := ~ ( q ~ "Then
) . part a)
of Theorem 4.11 shows that, with m , := 3M,, we have
where
/0:(q3"),so that
Alternatively, part ( b ) shows that if we set rn := 38:(q3n+1)
m , = (r, + 3 ) / ( r , - 1) (see Exercise 8 ) . We have
where
This latter is more suitable when we begin with an even singular value.
We have written (4.7.4) and (4.7.5) in a form consistent with Ramanujan7s
invariants Gn and g,. [See (3.2.13) and Exercise 5 , Section 3.2.1 In these
terms, the iterations are initialized by
and
q := <-&
[,
G,
AND
gn. We have
'
Thus
(4.7.9)
Similarly
9 = ( 1 + 2 f i GG) n( 1 + 2 f i $ )
Ggn
146
Correspondingly
From these two formulae, and other singular values given by Ramanujan,
one can give explicit equations for some very large invariants (G;,,, , G$,, ,
3
g,,,,,
for example). We illustrate with g,,,,. Ramanujan gives
(See also Exercise 13.) Analogous results for quintic and septic multipliers
are treated in Section 9.5.
Comments and Exercises
The quintuple-product identity was certainly known in essence to Ramanujan so that our derivation of Theorem 4.11 is in all likelihood similar to that
which he had in mind. The proof we give in Section 9.4 is self-contained and
can be read with ease now. Biagioli [Pr] has given a modular function proof
of (4.7.1).
1. a)
4.7
(a)
Show that
ii)
2
f i + l
g 1 , = 2
fi+1
iii) G,, = 2
148
and
9. a) Given that
verify that
3 - 4 7 . vi+d 312
GW6=(-)(
7/3
and
and
and
g~: = (f-i
149
g;$3=(V26+5)(fl-3)3
2
3)(fi
- 2),
g;0613
=(
and
f i - 3 ) ( f i + 2) and
l,lqEx5j.
m.
and r(4) = q 3 + fi +
Verify that m(1) =
Hence
Verify that r ( 2 ) = f i + f i and r n ( $ ) = f i - f i + l .
m(&)=3(36+7-4fi-5fi.
Verify that m($) = (3 - 6
+ 1) and m(&) = ( 6 G + 9) ( 4 6+6fi).
Show that (4.7.9) can be written as
X ~ - ~ ~ G , [ G > + ~ G :G+> + ~ ] x +
G~,[G>+I+~G:+ G ~ + I ]
=o.
b) Show that ,g:,
and
x2 - fig,[&
-&,,
+ 1y~
-
- g',[& - 1]+-/I
=o.
c)
G2,, = (2 + f i \ / j ) " 3 ( q ) [ v G m
+1 5 y
and
G25 19 =
(2 +1?)1/3=
m
i([f)-
151141.
150
and
a2
=(
(z)
(=)
m + x)"'
x~
ii)
exp
n2.rry
n=-a
n=-a
= (/,I
x +Y
cos
n=-m
x +Y
e-"'"'
cos (n2.rry)
e-n2mx
x +Y
,,==-a
sin (n2.rry)
--n 2.rrx
n2.rry
exp x + y sin (
x + Y n)
(-)
- x)"'
n=-m
- ( p ' T j 7+ x)lI2
e-"'"
cos (n2.rry)
e-n2?rx
sin (n2.rry).
n=-m
2
m
n=-m
e-n2?rs
cos (n27Ts1)=
t$(e-'")
e)
f)
and
gi;
=f
i ( f i+I ) ~ .
g)
and
~::=4(7+3fi).
to obtain
Chapter Five
5.1
In Section 4.6 singular values were introduced. We will call A* the singular
value function (of the first kind) where A*(r) := k(e-dr) as in Section 3.2.
We introduce the singular value function (of the second kind) a which we
define for positive r by
and
153
k := h*(r)
and
where
Theorem 5.1
The function a is montonically decreasing for r 2 1.
154
where q := e -4
. Expanding this gives
(5.1.11)
+ re-'^^)
r r l
and
where q := e -d
5.2
Calculation of a
CALCULATION OF a
1zr2 dMp(L k )
MP(W
dl
:[
(1)-PI.
E(k) - kr2K(k)
dK
dl
- (1) =
E(1) - lr2K(1)
11r2
dMp(17 k )
PM;(E,k ) [ E ( k )- k r 2 K ( k ) =
] Mp(l, k ) [ E ( l )- l r 2 ~ ( 1+) ]K(1)11r2
dl
On dividing each side by K(k)we have (5.2.1).
We now derive the key identity for a .
156
Theorem 5.2
For p and r > 0 let 1 := h*(r) and k := ~ * ( ~ Then
~ r ) .
=M
(5.2.3.)
pkk''
dM,(l, k ) ]
Mp(L k )
dk
This gives
where k := h * ( p ) .
Let us observe that, since h * ( p ) and Mp are algebraic for rational p, this
shows that a ( p ) is algebraic for rational p.
EXAMPLE 5.1. When p := 2, M,(l, k ) = 11(1+ k ) and k = A1(2) = f i - 1.
Then dM,(k1, k ) ldk = - and 4 2 ) = f i ( f i - 1)' + (\/Z - 1)2 = f i - 1.
(Compare Exercise 2 of Section 1.) Similarly 4 3 ) = (fi
- 1 ) / 2 and also
a(7)= (fi
- 2 ) 12. (See Exercise 3.)
5.2
Calculation of a
157
Before continuing to evaluate a it is necessary to connect it to Ramanujan's multiplier (of the second kind).
where
Proposition 5.2
Let p , k , and 1 be as in Theorem 5.2. Then
Now we use (4.6.2) for dlldk and (2.3.10) for qdkldq and obtain
and
(5.2.12)
~ ( p:=) R, ( k t , k )
k:=e-M.
158
TI
(a)
= h * ( p ) , we
have
EXAMPLE
I
114
-1
(i)
1 - I'
k= 1 + 1'
and
(5.2.16)
(ii) a ( 4 r ) = ( 1 + k)'a(r)-24Tk.
Calculation of a
5.2
1 + kl+ k'l'
3(1+ kl + k'l')
15
17
19
1+
6[(1+kl + k'l') + a
31
3{3(1+kl + k'l') + 4 ( a +
+
- 4 ( k k ' Z ~ ' ) ' ~ ~(kl)lJ4
[l
+ (k'Z')1'4])
35
2 [ V m + r n - W ]
+ (4kk'll')-''6[1-
m- W ]
+k
rn m)
-m
13
+ k ) ( l l+ k )
and
since I'
For example,
h*(4)=
(a- 1)'
and
a ( 4 ) = 2 ( f i - 1)'
+ k,
we
160
TI
Proposition 5.4
If 1 := A*(r) and k := h*(9r), then
JT-
(5.2.17)
(ii) a ( 9 r ) = s 2 ( r ) a ( r ) -
v7[s2(r)+ 2s(r) - 31
2
(iii) m(9r) =
Proof.
~ ~ (k 1) =, 2 a ( a + 2 ) = [s(r)- l][s(r)+ 31 12
as claimed. O
S(9r) = s2(r)S(r)+ 2 f i s ( r )
Proposition 5.5
If 1 := h*(r) and k := h*(16r), then
( i ) fi=
1-C - F
1+i c 7
5.2
(ii)
Calculation of a
a(l6r) = (1 + ~ ) ~ u (-r 4) f i y ( l + y
+ y2)
Proof. This may be derived similarly but is easily deduced from Proposition 5.2 and the quartic iteration of Exercise 3 of Section 1.4. 0
EXAMPLE
T-l.
E..
;
2
(T)
K)
k := ---v3-1
2fi
and
A*(n) = $G,~[V-d
m
A*(n)= $ [ d m - - V m ]
iii) A*(n) = g
m - gi].
162
TI
- *)(2
-f
i )l(8fi)
iv) g!,=.1/5+2,
klo=(fi+2)(3fi-I.6-2)
= (fi
- 3 ) ( v 2 - 11,
V) g:8 = 5 + 2 f i , k18= (5 + 2 f i ) ( 7 f i - 5 - 2 f i ) .
3. Use (5.2.6) with p := 3 to compute a(3).
4. Use (5.2.15) to obtain the following values of a (or 6).
i) s(~)=Vii) ~ ( 1 1 =
) [2xZ- (x - 3) +- Y
i
i
4
3
Note: G;," = x - 3 where x - x = 2.
~13 ~
(a
-1
.
-6
1) 12.
Use (4.6.8) and (5.2.6) to compute 6 ( 1 3 ) = ( 7 + 3 f i ) ~ ; , 6 where
G;:=
(a3)/2.
iii) a(15) =
5.
6. Show that Rp(l, k) = R,(kr, 1'), that Rp-~(k,I) = -p-'~,(l, k), and
that ( ~ ( p - ' )= - a ( p ) lp.
7. Use (5.2.14) to show that
) (810 + 3 0 6 f i ) f i 7314
ii) 2A*(49)A*'(49) = (1863 + 7 0 4 ~ 7= (7lI4 -
677)
l2
2
and
a(49) = - 6 7 [ f i 73'4(33011+ 12477fl) - 21(9567 + 3616fl)I .
8. Use Proposition 5.3 to calculate
5.2 Calculation of cv
where
= *k'
k2
A G ( ~ k')
,
AG(1,
a(p) -flk2
AG(1, k')'
1
flk'kZpi + [ a ( p) - *k2]
qi
where x, := l / k l , q, := x,, y, := 6 ,
and p, := xt/(x,
+ 1) while
Hint:Let q := e-4
Hence show that
b) Show that
and
165
lu
:! so that with r := K ( y ) ,
For k t = 1 we have L = f l K and (5.3.3) becomes
This is (5.3.1). Then (5.3.2) follows from (5.3.1) and Exercise 6 of Section
5.2. (See Exercise 1.)
For p = r, (5.3.1) reduces to
2 ~( 1,/ f i ,k )
g ( p 2 )=
MP( 1/ f i 7
k)
k := k(e-"')
kt"
l+k
+R p ( Z 7. k ) .
+ y',
while
$(k+ Yl)]
1-k
~ ( 2 ~ ) = ( 1 + k ) ~ ~ ( ~ , k ) + $ ( l + k ' ) .
166
while R3(y, k ) = l +
Similarly G k r 2= f i g y k , and we
+ l)]k. Thus
+ 2 f l + fi)
and
~~(6)=~~~*(6)(3-fi)=5fi+6V3-8fi-11.
Corollary 5.2
and
where
Proof. This follows from Theorem 5.4 and (4.7.9). (See Exercise
3.) 17
5.6. By piecing together various formulae many more values of a
can be obtained. We illustrate this as follows. Given 4 6 ) and h*(6) from
the previous example, we may use Proposition 5.3 to compute a ( $ ) . Then
the functional relation for a , equation (5.1.5), yields a($)= ( 5 a - 6 f l 8 f i + 11) 13 = 0.5138118 . . . . Similarly, given a(3) and h*(3) from Example 5.3, we can use Proposition 5.3 to find that a(:) = 66 + 47fi - 38fl2 7 6 = 0.5138837 . . . .
Indeed numerical computation of the maximum of a [using (5.1.10) and
newton,'^ method] shows that it occurs around 0.709 with a value of
approximately 0.514275. [Note that ( $ + :)I2 is very close to this point.] In
addition, one may observe graphically that a increases up to this value and
then decreases.
EXAMPLE
ff
rom the given formulae and appropriate singular values, many values
an be found in closed form. Some of the cleanest are given below.
Establish (5.3.2) of Theorem 5.4.
Use Corollary 5.1 and the given value of g, to compute a(n) or S(n).
i) g ! o = f i + 2 ,
,
ii) g 8 = ( + )
iii) g:, = (fi
+ I),
iv) gg8=(V%+5)/2,
a(10) = g~,h*(10)(3fi- 4)
a(18) = g;,h*(18)(21 - 6 a )
4 2 2 ) = g$h*(22)(33 - 1 7 f i )
a(58)=g~,h*(58)(99~-444).
'"
:
6
vi)
-6
g30
=6
+ 5\/i,
S(30) = g:oh*(30)[(56
+G
+ 38fi)
d o l
vi)
and
4 2 1 ) = 3 ~ , ; 6 ( l l +6 f i + 21/;j +
CT(?)=
a)
G y g 3 J T (5m-13V%+49fi-19)
168
vii)
u(93) = 6 G i ; ( 7 ) 3 ( 1 5 a
+l
3 m + 201fi
+ 217)
- 1/(2g;),
deduce that
="=
14 + 1 0 f i
33 - 172/2
and
"58
140 + 2 6 a
= 99.\/zis - 444
(a)
(m)
9.
5.4
Recursive Approximation to rr
Further such evaluations follow from Table 9.1, which gives K( h*(n))
in I' terms for 11 n I 16.
5.4 RECURSIVE APPROXIMATION TO rr
and
k,:=h*(r).
rn := Rp(kn7kn+l)
-.
E,
be as in (5.4.1).
170
Proof.
?r
Rewrite (5.4.2) as
5.7.
and
(5.4.6ii)
and
O<cw,-rr
-1
r16.4"fie
-drn.\Tr7r
Iteration 5.3 just peforms two steps of Iteration 5.2 as one, with some
mputational saving. The error bound given in each case is very accurate.
A Cubic Iteration 5.4
S,
mn
:= -
mn
s:+2s,-3
2
O<a, - r r - 1 s 1 6 . 3 n f i e - 3 n G " .
One can also provide a cubic iteration using k , instead of s,. This is
mewhat more inelegant and no easier to initialize. [See Exercise 3b).]
Large numbers of initializations for Iterations 5.1, 5.2, and 5.3 are
ailable in the previous sections. We collect some of the cleanest in Tables
information is given in Table 5.3. Recall that
ssary to verify these values lies in Section 4.7 and
e previous sections of this chapter.
When p := 7 we can write the iteration cleanly in terms of u, := k ,114 .
( 1 - unun+l)8= ( 1 - ~ f l ) ( ' - ' : + I )
2A*(N)A*'(N) = G , ' ~
a m
f f
(N)
v2-1
$
TABLE 5.3. Selected cubic invariants
174
~r
The updates for a(25r) and a(49r) are studied again in Section 9.5, where
solvable versions of the quintic and septic iterations are given.
n=O
175
Let
.- bn
Sn.-
tn :=
an
and
an+,:= S, an+ 7nv7(7- sn - tn)
2 1,
176
Then
an+':=5cnan 5"+'VF(dn
8
+ u,+'
- c,u,)8
satisfies
O < an- a-' < 1 6 - 5 " - \ / ~ e - ~ " ~ "
7.
for r5'" 2 1.
Show that, in Iteration 5.1, convergence is indeed pth order.
fi.
a) Show that
and
Verify that
where z,
G;:/fi,
and
14z3(1- 2,)
d R 3 -' f i t 2
where
c)
Compute that
4 2 2 ) = (V?
+ 1)~(33- 17fi)(3fi
- 7fi)(10 - 3 f l )
and
where x := g,,,
Verify
that g:,
c)
5.5
and y := g,.
+ g:
=fi
+ 1 and that
179
4,
Proposition 5.7
2 112
K1,4(h) = (1 + k
K(k)
if2hhf=[(g12+g-12)/2]-1a n d ~ r h r l l f i 0, < k s f i - 1 .
b) K,,,(h) = [I - (kk')2]114~(k)
if 2 h h f = J-11' and o s h r l l f i , 0 l s k s l l f i .
a)
c2.
Theorem 5.6
180
ls
2K
1 1
2k
(ii) - ?r( k ) = k r 2 F ( ; l ; - ( 1 )
0cksfi-1
2K
(iii) - ( k ) = k ' - 1 1 2 2 ~ ,
0sk2s2fi-2
k2
?r
( b ) (iv)
2K ( k )= (1 + k 2 ) - 1 1 2 2 ~ 1
?r
2K
( u ) - ( k ) = ( k r 2?r
(c) (vi)
2K
1 5
1 2 12
- ( k ) = [1 - ( k k r ) 2 ] - 1 1 4 2 ~,1-; 1; J 7r
1
fi'
'
Proof.
(a) We let s := 0 above to deduce ( i ) . Then (ii) follows on replacing q by
- q in the theta function representations of K and (2kk1l2.This is
Jacobi's imaginary transformation of Exercise 7d) in Section 3.2. We
derive (iii) from (ii) by replacing k by k , := (1 - k r ) l ( l+ k ' ) and
using the quadratic transformation K(k,) = [ ( l +k 1 ) / 2 ] K ( k ) of
Theorem 1.2.
( b ) (iv) comes from letting s := $ above. Then ( v ) again follows from
Jacobi7simaginary transformation.
(c) ( v i ) comes from letting s := 4 above. CI
Similarly,
Theorem 5.7
For k restricted as in Theorem 5.6
Note also that we may use (5.5.5) with s := 0 and Theorem 5.6 to
produce similar series for E. We are now ready to build our series. Recall
(5.1.4), which we write as
Thus given a ( N ) and A*(N), we can combine (5.5.10) with Theorem 5.7 to
produce series for 1/71..In like fashion we derive series for 1IK or for the
Gaussian AGM, M ( l , k t ) = n-/2K. In each case we have [(2Kln-)(k)12=
m(k)F(+(k)) for algebraic m and 4, while F(+) has a hypergeometric-type
7 4mF
~ ' + 4 m(/)p(+).Subpower series expansion C:=, an+". Then 4 ~ ~ 1 =
stitution in (5.5.10) leads to
182
Thus for rational N , the bracketed term is of the form a + nb with a and b
algebraic. We now specialize this for our invariants.
SERIES IN
where
a n ( N ):= [ a ( N ) - m k ; ]
SERIES IN
+ n m ( k h 2- k;)
gN: For N r 2,
where
SERIES IN
g4N = 2
114
g N G N : For N
$,
where
vm
c n ( N ):= [ a ( N ) - 1
k',] kh-l
SERIES IN X N :=
where
(gE +
2
g;12)-1
+ n m ( k &+ kh-')
- 4kNkh2
( 1 + k;)2
For N > 2 ,
'
184
?r
where
on(N) := [a(N) - m k i ] + n 2 m ( k h 2 - k i )
and for N r 2,
where
- "~TN
, it is very
which adds eight digits a term! Since k i behaves like 16e
easy to estimate the convergence rate in each series. For N at all large, the
rate while linear is most impressive. In the exercises we give various other
examples. Bailey [35, p. 961 gives (5.5.14) with N := 2 [equivalently (5.5.15)
with N : = # ] and ascribes this to Ramanujan. The series is
185
one has
Proof. Formula (126) in Goursat [I8811 and Propositions 5.6 and 5.7
combine to produce (5.5.26) and (5.5.27). 17
There is a corresponding formula for the larger solution. This implicit
formula for 1 in terms of k can be solved explicitly as follows (Exercise 19).
+ 1)/2*,
For h I(fi
where
and
a,
186
(This can be verified from formulae in Goursat [1881].) In other words, the
Nth singular value of K is sent to the Nth, (Nl2)th and (Nl3)th singular
values of K,,,, K,,,, and K,,,, respectively. Thus
and
while various singular values for K,,, are given in Exercise 19b). If we now
define a, by
(5.5.32)
a,(N)=-
Tr
COS ( r s )
4 ~ : 1+2s (
k:=A:(N)
where
b, (N) :=
~~~
1
cos (7rs)
while
The details are left as Exercise 20. Now, with some perseverance, we can
derive series including Ramanujan's missing formulae, which come with
s := 2 and N := 4 and 5 in (5.5.33) [See Exercise 20b) .]
Finally, we observe that the result of Exercise 22 in combination with the
discussion of I' values in Section 9.2 shows that the formulae for K,,,, K,,,,
and K,,, are in essence the only such formulae.
Comments and Exercises
5.5
187
5. Show that
6. Show that
7.
while
and
8. Show that
9. Show that
and
Show that
m
~ f i M ( 2 "2-I")
~
= ~r
n=O
Prove this by showing that both sides satisfy the same generalized
hypergeometric equation and are analytic at zero, with value 1 there.
Verify that
5 .
189
l+t l-t 3
- - . sin2x
2 .'2'
c)
t)n
2 (t)n((2x)2n
(2n)!
[This is also the limiting case of b). See also Exercise 16 of Section
11.3.1
d) Find similar formulae for sinh [t log (x + l/l+;;i)] and for
log (x +
e) Establish that
m).
and that
f) Prove that
sin ( r t )
gt
-=
a)
Prove that
.=o
(t)n(-t)n
(n!)'
'
190
b) Show that the previous integral is Y(3) 110. (See Exercises 12d)
and 12e) of Section 11.3.)
18. Let G,(x) := ,F,(a, (2n
+ 1)/2; 2 a + 1; x).
i)
~ ? = 2
ii)
g:
iii)
G?
iv)
vi)
gives
= (fi1)'
+f l ) '
H? = 1
gives H? = 2
gives H
& = f i ( V 3 + 1)'
gives H?
g = ( + ) 4
gives H?=
= (2
2(2 + v3)2
3fl
27
( l4fi3&13fi)
Hint: For iii) and vi) use the increasing form of (5.5.28). which
gives H,, in terms of G,. This entails changing th; central sign in
(5.5.28ii).
20. a) Establish the general formula for .rr-' given by (5.5.33) and
(5.3.34).
b) Verify the following values of a,,,(N) + nb,,,(N):
i) N : = 2 gives (6n+1)13fl
ii) N := 4 gives (60n + 8) 127
iii) N := 5 gives (66n + 8) / l 5 f i .
c)
I
5.6
OTHER APPROXZMATZONS
v y
'
-\"
vy"
so that
192
with error 0 ( f l e - + ) ,
(5.6'6)
and
3
rr2(p):= Vy - [ a ( p ) ( 4 G Z 4- 1 ) ] / [ ( 1- 2k2)(8GZ4+ 1)]
3(3 + v3)
(5.6.7) ~ ~ ( 2=5 )
5
and
~ ' ( 2 5=)
63(17 + 1 5 a )
25(7 + 1 5 G )
84
~ ~ ( 3=7 )
2 1 f i - 101
(5.6.8)
and
~ ~ ( 3=7147
)
1 4 5 m + 1134
2 2 3 9 9 m - 41916
(The values of a and a are not quadratic surds.) For even p it is better to
use
(5.6.11i)
fllog ( 8G
E)
log ( 5 )
xN
1
-log (1728J,)
2 f l
m,+
-4
a. fi )
5
12
( 4kt2
-k '
x := 1 ( ~ ) 2= 4
n
m
=q
n=l
(1 + qn)24
and there is a similar expression for (kkt/4)'. We may expand this product as
a power series and compute as many terms as we wish of its reversion. This
will produce a series of the form
= x - 24x2
$i
+ 852x3- 35744x4+ . . .
log x + f l=~
24
log (1 + qn).=24
---n=l
k=l
k
(5.6.16)
which may be expanded as a power series in q. When we substitute (5.6.15)
into (5.6.16) we will recursively compute
194
Each series adds roughly T<N logl,e digits a term. Hence when we use g,,,,
given in (4.7.14), we gain more than 90 digits a term! But of course we have
to compute the logarithm as well. Thus in our context the formula should be
viewed as a rapid series for log (1x1) + m r r , not as a computation of rr.
Comments and Exercises
Approximations (5.6.5) and (5.6.6) were the reason why Ramanujan computed R,. He did not give (5.6.9). Many very large invariants, including
G141ss,G,,,,,, and G2,1,s are derived in Shanks [82].
1. a)
Establish that (5.6.5) and (5i6.6) have the claimed errors without
using the rather deep formulae (3.2.16) to (3.2.18).
b) Show, using (3.2.16) to (3.2.18), that
and
2.
3 ( 3 a + 7)
17
and
n;(13) =
1 0 3 a + 125
158
c)
66a
3 3 m - 148
and
q(22) =
6.\/zz
33 - 1 7 a
error
'
195
and
4
log (842/2) .
rn
a) Show that
and
64gZ4= q - l - 24 + 2764 - 2048q2 + - .
with similar expressions for GZ4.
b) Thus
and
6 4 ( ~ ; + ( 3 ~ ' ~-) 24 = elTG + 4372e-lTm + . - . .
c) When g, or G, is a quadratic surd, this gives an expression for the
integer part of era (and the proximate 0's or 9's). Thus the
integer part of elTm is 2,508,951. That of elTm is 199,148,647, and
that of elTm is 24,591,257,751.
a) Show as discovered by Beukers that, with {a,) as in (5.6.17),
196
where the error in ii) is about 2-'t3 and the error in iii) is about
3 .2-l7tS.This is pleasantly developed in Almqvist and Berndt [Pr]
and in Berndt [Pr]. Truncating the approximation in iii) leads to
iv) p - r ( a + b)[l+ QtI2
with an error about 2-'t3. This is due to Nyvoll [78].
a) Prove i).
b) Justify the error estimates in ii), iii), and iv).
One may avoid the Landen transform in a) by following Ivory
[1796]. We write
gives
2?ra(5fi
- 6 f i -7fi
+ 9)
l~a[3(2+ f i ) 6 ( 2 f i - 2 f i - 1)
b) Let x := ( ~ , / 2 SO
) ~ that x = [ $(1 -
e)
/ ( l + *)I4
and
..
An(% P ) + L ( P 7 a )
2
henever /3 > a > 0 and
AG(B,
v m )
=1
198
and
and
Show that
h) Show that
Chapter Six
The Complexity of
Algebraic Functions
Abstract. The aim of this chapter is to analyze the complexity of algebraic
functions in general; and of multiplication, division, and root extraction in
particular. There are two primary tools, Newton's method and the fast
Fourier transform.
6.1
COMPLEXITY CONCERNS
It is obviously inappropriate to consider the multiplication of two manythousand-digit numbers to be of equal difficulty to the multiplication of two
single-digit numbers. A reasonable measure of complexity that takes this
into account is the bit complexity. The bit complexity of an algorithm is the
number of single-digit operations required to terminate the algorithm.
singfee-digitoperations include addition, multiplication, logical comparison,
and storage and retrieval of single-digit numbers. We are exclusively interested in how the complexity increases with the size of the problem. For
example, addition of two n-digit integers by the usual algorithm has bit
complexity O,(n)-the
subscript B on the order symbol is for emphasis.
This is a serial notion of complexity in the sense that on a serial machine it is
an appropriate asymptotic measure of the time required for the calculation.
We use the slightly nonstandard notation
a, = O(bn)
and
b, = O(a,) .
--
201
= fl(b,), we say that {a,) and {b,) are equivalent. Since accessing an
it number requires fl(n) bit operations, it is apparent that "usual"
ion of two n-digit integers is in fact fl(n). These trivial lower bounds
e a consequence of uniqueness considerations-if we change any digit of
of the numbers being added, we change the answer. Thus any algorithm
additioli must at least "inspect" every digit. So, up to a constant, usual
diton is asymptotically optimal. As we shall see later, one of the interestg consequences of this body of theory is that "usual" multiplication is far
om asymptotically optimal.
A detailed approach to complexity requires a model of computation and
perhaps most readily made rigorous in an analysis of Turing machines.
ee, for example, Aho, Hopcroft, and Ullman [74].) This much detail is
necessary for our purposes.
We will usually content ourselves with merely counting single-digit addins and multiplications. In all the algorithms we present the comparisons
ote that the comparison of two n-digit numbers is LR,(n)] and the storage
ncerns will be bounded by the arithmetic operations-provided the alrithms are sensibly implemented. This is almost always transparent and
11 rarely even elicit comment.
Operational complexity counts the number of operations (addition, multiication, division, and extraction of kth roots performed to a precision
ded by the precision of the output). When all the operations in an
ithm are performed to roughly the same precision, this is a useful
easure. The reasons for the particular choice of operations will be made
parent in Section 6.4. Thus the algorithms of Chapter 5 compute n digits
T with operational complexity O,,(log n); once again the subscript on the
der svmb01 is for em~hasis.
of the primary tools for the analysis of algorithms is the use of recursive
tions. The idea is to divide a problem into smaller subproblems that can
e solved by essentially the same technique and then recurse, a strategy
often called "divide and conquer." The reader unfamiliar with this approach
might like to examine the exercises. One of the lessons of complexity theory
is that many of the usual algorithms of mathematics are far from optimal.
Multiplication, taking Fourier transforms, and matrix multiplication are
but three examples. (See Exercise 3 and the next section.) A second lesson
is that good lower bounds are very difficult to obtain. For the analytic
algorithms we are considering, the only lower bounds we can establish are
the trivial ones. Thus unless the algorithm is of the same order, as is the case
for addition, we cannot achieve exact results,
We shall not discuss combinatorial complexity except to mention that an
introduction to this well-developed and important field may be found in
Aho, Hopcroft, and Ullman [74].
202
1. Prove the following. Let a, b > 0 and c > 1. Suppose that f is monotone
on (0, 00).
b) If f(n)
Iaf(nla)
+ bn(1og n)"-'
ifa<c
if a = c
ifa>c
Show that
and
203
The final term comes from using usual matrix addition for the 18
additions. The constant a can be chosen independent of n and can
be used to include the "overhead" of actually breaking the problem
up. Use the above inequality to show that
4.
D-I
(One can show that in general matrix inversion and matrix multiplication are asymptotically equivalent. See Aho, Hopcroft, and U1lman [74].)
5.
204
6.2
These are the two directions of the finite or discrete Fourier transform.
The classical approaches to either part of the Fourier transform problem
have operational complexity at least cn2. This is the operational complexity,
for example, of evaluating p, at n + 1 points using Horner's rule [writing
p,(x) = (((anx+ a,-,)x + a,-,)x + . - -)I. We wish to prove that, in fact,
both directions can be solved with complexity Oo,(n log n). Actually, we
only treat the case n + 1:= c2", which is sufficient for our purposes and
somewhat simpler.
Theorem 6.1 (Fast Fourier Transform)
Let
and
xr(x2) := x(al + a,x2 + . - . + a,xn-l)
Then with y := x2,
205
+dy)
( w ) -(w
(n+1)12+i 2
+ 2 - 2"
F(1) = 0 .
- rn
F(2") = 2"+'
(; :5 ;:)
(
:-:)
1
w :=
1::
wn
(6.2.5)
...
W-' = n i l
...
wZn
wn2
1
1
1
1
-1
w
w-2
w -4
...
W-,,
w-~"
...
-2
* * .
w-2
and w-I is also a primitive (n + 1)th root of unity. (See Exercise 4.) The
interPolatioh problem can be written as: Find (a,, . . . , a,) so that
W(ao, . . . ,a,,) = (a,,
. . . ,a,,).
206
= (a,,
. . . , a,)
i = 1 , . . . ,2n.
p(wi)q(wi)
Step 3: Solve the interpolation problem for pq to find the
coefficients.
Show, using an FFT, that the above algorithm has operational
complexity
* Strictly speaking, we should be using (2")th roots of unity where we have established an
EFT. This can be arranged by padding with leading zero terms, if necessary, without changing
the order of complexity.
207
and so
Establish that
[Note that we may assume q(0) # 0.1 Now the computation of si+ := 2si s:q can be performed using an FIT-based polynomial multiplication and
need only be performed using the first 2j - 1 coefficients of q and si. By
starting with an appropriate first estimate of s; [say, so(x) := 1Iq(O)] and
proceeding inductively as above (doubling the number of coefficients
utilized at each stage), show that the required number of terms of the
expansion can be calculated in O,,(n log n). (See Section 6.4.)
208
3.
.ij={","
i=o
5.
j=Omod(n+l)
otherwise .
6.
si:=
and con6ruct the sequence
and
x+x2+x3+x6+x12+~13+x26+x27.
a) Prove that the above method computes xn and observe that it only
requires storing x, n, and one partial product.
b) Show that the number of multiplications is less than 2 [log, n] .
c) Show that the above method is optimal for the computation of xZm
(considering only multiplications).
d) Show that the S-and-X method is not optimal for computing xl?
An extended discussion of this interesting and old problem is presented
P(x) := bnxn-l
+ bn-lxn-2 + - .. + bl .
a = a(10)
and
/3 = /3(10) .
Y(X):= a(x)P(x) .
210
theory) there are plenty of FFT analogues in a finite setting, one may, if one
prefers, consider the entire algorithm performed mod (O(n)).
Step 1: . Evaluate a(x) and p(x) to precision O(1og n) at the 2n points
wl, w2, . . . ,wZnwith w a primitive (2n)th root of unity. Using the FFT, this
has operational complexity
O~(nT(logn)) -
OB@>.
This step essentially requires only addition. The coefficients of y are closely
related to the digits of ap except that they may be too large and a "carry"
must be performed.
The total complexity thus satisfies
T(n) = OB(n(logn)2(log(logn))2 . . .)
with the product terminating when the iterated log is less than 1.
This is not optimal. (See Exercise 2.) The same analysis as above using a
base n representation of a and 3./ reduces the time for multiplication to
OB(n(log n)(log log n) . . .) .
This is still not quite the asymptotically fastest known algorithm. The best
bound is due to Schonhage and Strassen [71] and is
O,(n(log n)(log 1% n)) .
We shall call any multiplication that performs with this speed a SchonhageStrassen multiplication. It should be emphasized that in all the reduced
complexity multiplications we present, the order estimates include the
overhead additions (and storage concerns).
algorithm; details
213
and
M(n) is nondecreasing .
(6.4.2)
Since two multiplications of length n can be viewed as subproducts of a
single multiplication of length 2n, while four multiplications of length n
comprise one of length 2n, the first part of the assumption is reasonable.
Since multiplications of length n can be padded with leading zeros to
multiplications of length n + k, the second part is also reasonable. Of course
it is easy to imagine a perversely designed multiplication for which (6.4.2)
does not hold.
Let D(n) and R(n), respectively, denote the bit complexity of division
and extraction of square roots, where the input and output are to precision
n. We say that two operations are equivalent if the complexity of one is
bounded by the complexity of the other and conversely. For example, we
say multiplication and division are equivalent if, given a multiplication with
bit complexity M(n), we can construct a division with bit complexity
D(n) = O(M(n)); and conversely, given a division we can so construct a
multiplication. The following remarkable theorem, the first part of which is
due to Cook, may be found in Brent [76c].
Theorem 6.3
214
Proof.
(a)
ab = llb '
(c)
6.4
Newton's Method
This satisfies
is Exercise 7.
It is apparent that any time a function may be quadratically computed by
Newton's method from an iteration involving only addition, multiplication,
and division, that function will be of complexity O,(M(n)). This applies to
any algebraic function over Q(x), that is, any function f satisfying an
equation
(6.4.8)
@(x, f(x)) = 0
Iff is algebraic over Q(x), then the complexity of calculating n digits of f(x)
is O,(M(n)).
The preceding results, of course, assume that we are avoiding the branch
points of the function in question.
Comments and Exercises
Newton's method has a host of refinements and variants. See, for example,
Householder [70] and Exercise 3. The iteration for square roots can in some
form be traced back to the Babylonians, who used one or two steps of the
method.
It should be observed that not only is D(n) = O,(M(n)), but the constant
concealed by the order sign is fairly small. [From (6.4.5) we see that a
constant 8 works. Indeed, for all known multiplications, the additions term
is negligible and a constant of 4 is appropriate.] This is also the case for root
216
extraction. See Brent [76c], where a number of constants for these and
various other equivalences, as in Exercise 7, are established.
Further discussion of the calculation of algebraic functions may be found
in Kung and Traub [78]. Related matters may also be pursued in Lipson
convergence. For which real starting values does the method converge?
3. a) Consider the iteration
:= x k ( l + (1 - YX,)
+ (1 - y ~ k ) 2 )
Show that
6.4
Newton's Method
c) Let
x,+, := Qxk(15- 10xky+ 3y x,) .
2
2 4
Show that
be the iteration (6.4.3) for computing llx. Show that x,,, is the
(2,+' - 1)th Taylor polynomial of f(x) := 1lx expanded around the
point 1. Thus for division we may think of Newton's method as a
means of accelerating the computation of the Taylor series.
Let x, := 1 and let
be the iteration (6.4.6) for computing fi.Show thqt x,+, is,a rational
function r(x) with numerator of degree 2k and denchinator of degree
2, - 1 that satisfies
218
b) Write down Newton's method for computing exp from log and log
from exp. These particularly simple iterations combine with a) to
show that the problems of calculating exp and log are effectively
equivalent, from a bit complexity viewpoint.
9.
Hint: Break the number to be converted in half (base k), convert each
half, and recombine. (See Knuth [81] for a lengthy discussion of this
problem.)
10. a) Show that n! can be calculated with bit complexity
Chapter Seven
7.1
~r
AND
LOG
Proof. Both Algorithms 2.1 and 2.2 as well as most of those of Chapter
5 perform with the above complexity.
220
(:)I=
(71.1)
I K ' ( ~ )-log
(7.1.2)
(7.1.3)
(7.1.4)
I E ' ( ~ )- 111101k2log k (
Kr(k) -log
():I
5 101k2log
kl
k E (0,
k E (0,
where
and
7.1
lr
and Log
77
ZAG(1,
+ p:)/2]2]
222
and hence
7.1
l~
and Log
A,+, := ;(A,
+ B,)
+, 4
Hint: Imitate the second proof of Theorem 1.1and use the fact that
A, and B, commute.
e) Let
224
We recall that
The algorithm's complexity is determined by the series expansions employed in steps 1 and 2. These "sparse" series yield n-digit accuracy after
.\/iinonzero terms. While the asymptotic complexity is far from optimal, the
algorithm has the advantage of not requiring very small starting values for
the AGM iteration (Step 3). Also, only a single AGM iteration is required.
Sasaki and Kanada [82], who proposed and analyzed the above algorithm,
show that it out performs the methods of Section 7.1 for numbers in the
225
(i)
= log
($)
for various m. This speeds up the series calculation at the expense of the
AGM. For n-digit precision, using m = n effectively reduces this algorithm
to Algorithm 7.1. (See Exercise 1.)
For certain choices of q we get reduction in complexity. If q is any small
integer, then the series expansions are particularly easy to evaluate. (See
Exercise 2.) This leads to a very fast algorithm for rrllog 10 (using base 10
arithmetic) as follows. From (7.2.1), (7.2.2), and (7.2.3) we have
Now for q := 1/ lo4 both of the above series are just sequences of 0's and 1's
and the starting values for the mean iteration can be calculated very quickly
[O,(M(n))]. The remainder of the work involves calculating a single AGM.
Similarly rrllog p is amenable to very fast computation in base p.
Properly interpreted, (7.2.6) remains valid for matrices and provides a
theta-based computation of the matrix logarithm of a positive definite
matrix. (See Exercise 3.)
2.
226
3.
Proof. For our purposes the elementary functions are the rational
functions, log and exp, and any function that can be formed from these
functions by a finite number of compositions, multiplications, additions, and
solutions of algebraic equations. (See Davenport [81] or Ritt [48].) The
point of the proof is that such an f is constructed from Q(x) by taking a finite
number of exponentials, logarithms, and solutions of algebraic equations in
these quantities. The number of algebraic equations to be solved determines
the constant s. As in Chapter 6, solution of the algebraic equation in
question can be effected in a time proportional to the complexity of
evaluating the equation.
A number of comments are in order. First, we can formulate the above
theorem for f algebraic over R(x) if we assume that the requisite real
numbers are given. Second, while multiple solutions of an algebraic equation pose no theoretical problem, in practice, determining the "correct root"
The algorithms for exp (see also Exercises 1and 2) require inversion and are
much less satisfactory than the algorithms for log. A direct OB((logn ) 2 ~ ( n ) )
algorithm is presented in Chapter 10. There are a number of issues that
remain unresolved in this discussion. The most obvious is a discussion of
lower bounds. Observe that, by Theorem 6.4, showing that exp does not
have complexity OB(M(n)) would show that exp is a transcendental function. Likewise, showing that T does not have bit complexity OB(M(n))
would imply the transcendence of T.The gap between the known operational complexities for log [O,,(log n)] and exp [O,,((log n)')] is probably specious, but this also is not known. We will show in Section 8.8 that direct
algorithms for exp and log of the type that we derived for K in Chapter 1
cannot exist. There are no quadratically convergent fixed iterations for these
functions.
1. From (2.3.7) and Exercise 3 of Section 2.5,
2.
228
1- k:,
k,+l := 1+ k:,
and
tan(c#+,+,-&)==k,',tan&
a) Show that
F(+o, k) = c$o
+ 0(k2)
k -+ 0 .
as
b) Show that
+ O(1-
k)
as
[E(?)I
log tan
1+ k:,
[ n (T)]
:(
t)+
n-1
tan-' w, =
m=O
[E(T)]+
1 k:,
log tan
~ ( k :+ (1 - ko))
($ + tan;
1% 6 +
w ~ )
w',+ (1 - ko))
229
d m+
where 6 :=
w,. Show, by inverting c), that tan-' can be
calculated from log with complexity O,(log n M(n)). Show that, by
inverting the above, tan can be calculated from log with complexity
O,(log n M(nN
This approach, which avoids using complex arithmetic to access
the trigonometric functions, is essentially due to Brent [76a].
Let L(n) denote the bit complexity for evaluating n digits of log. Show
that
D(n>= O,(L(n))
and hence, log and exp are at least as complex as multiplication.
Hint: Consider computing the derivative of log.
Chapter Eight
if and only if a = b.
A mean is homogeneous if
(8.1.3)
for a, b, A>0.
) A mean is symmetric if
(8.1.4)
M(a, b) = M(b, a)
for a, b > 0.
A mean is (strictly) isotone if, for a, b > 0,
(8.1.5)
M(a, .)
and
M(., b)
is a (strict) mean.
Corresponding to each mean M we associate another mean M,, defined
by
232
or
E R X let
Then H, is just the arithmetic mean A , and thus H, is a strict, homogeneous, symmetric, isotone mean. Moreover, limp,, H,(a, b) = flis the
geometric mean G and may reasonably be denoted by Ho. (See Exercise
14.) Since H-, IHo IH,, Proposition 8.l(e) shows H, to be a strict mean.
For all p one can unambiguously define H, for a, b 2 0. Then the trace of H,
satisfies
t H p + )=
Note also that (H,)
-,= H-,
[2- , I p , 00)
[O, 2-lIP)
p r o
P<O
For p E R we let
e. 1
r n U Y I . LICI
I..=L...U
"dd
the onlv means common to the Lehmer and Holder classes. Again
- there is
difficulty extending Lp to a, b r 0 and
{I
and
tLp(a)= 2
P>O
;::
= (l-,)(Ll)(a,
b) = L(,-,)@, b).
. Consider f = L~
where q := s - r. Then
[=I
as
8.1.14)
f -l~,,,-,(f(a),
f(b)) =
bs
I/("-r)
=: G,,,(a, b)
r(a, b) := f-I[
S.bf(.)d~]
b -a
234
(8.1.16i)
with
(8.1.16ii)
b-a
=: 2 ( a , b).
log b - log a
and
(8.1.16iii)
The mean 2
' is the logarithmic mean and 4 is the identric mean. These
means have
{ i-I
-l/(p-1)
t s p >=
p>0, p f 1
p=l
p r o
and
tsp(a) = a .
Also
(Composition)
If M is defined by
where Mi, i = 0,1,2, are means, then M is a mean. If two of M,, MI, M, are
strict, so is M. If all three are homogeneous, symmetric, or isotone, then so
is M.
i-
235
a)
@(a) dM(a' b,
da
+ @(b) dM2pd b,
2 @(M(a, b))
Use a), b), and the convexity of @ to show that Hp is increasing for
p in [W.
d) If M is homogeneous and symmetric and if Mp is increasing for
p > 0, then Mp is increasing for p in R, whenever M,, exists.
c)
236
p r l
psl
p50.
lim M , ( a , b ) = a ~ b
p-+ -m
and
limMp(a,b)=avb.
P-=
p+-m
and
lim M(a, b) = a v b
p-tm P
c) Show that
237
S-r
2 log
$,(a, 1)
xZ1.
238
a) Then
and
(2.
-
Then
'
c) Let q(x):=
x > 1, x rational
otherwise
(icka
k=O
c,=-O,
&,=I.
k=O
239
M ( @ ( a ) , @ ( b ) ) = @ ( N ( a , b ) ) a,b>O
and we write M >, N or M > N. If M and N dominate each other, we call
M and N equivalent. Since domination is transitive, this is an equivalence
relation which we write -. Unfortunately most of our results demand that
we consider more restrictive notions of equivalence, so we require that @ be
one to one from now on. ~ e n c @
e is monotone.
Theorem 8.1
Suppose that M and N are two means with M >, N .
M a , b)=g ; l ~ ( g t ( a ) ,
@=mP+p
(c)
t
t
:
J
cu>O,p#O
and N = H,, p # O .
Suppose that M := G. Then either
and N = H,, P E R .
It
follows that the only homogeneous means equivalent to A are the
(d)
Holder p means ( H p ,p # 0 ) and the only homogeneous mean equivalent to G is G.
Proof
240
(8.2.2)
= g,(a)
2g(?)
+ g,(b)
a, b E rng (@)
(8.2.3)
x E rng (a)
(c)
+ b(t) and
241
Proposition 8.5
In the next lemma we give several conditions for (8.2.8) to have only the
trivial solutions (normalized). Under these conditions we have g, = c(t) L and
Then Exercise lb) shows h = cup, a > 0, p # 0, and it follows that the only
homogeneous means strongly equivalent to M are Mp, p # 0. Thus we have
determined the strong equivalence class of M in the cases covered by the
next result.
Lemma 8.1
Suppose M is a homogeneous strict mean.
(a)
The following two conditions imply that (8.2.8) only has the trivial
solution when h,(l) = 1.
(i) M is ultimately monotone and
(ii) h,(ii) =. ii for some 2 # 1, ii > 0.
242
:
(b)
Proof.
Given that i and 1 are distinct positive fixed points of h,, it follows
from (8.2.8) and continuity that {c > Olh,(c-) = c) is a closed interval
C. We show that s := sup C is a. If not we argue as follows.
For any c < s in C, M(c, s) < s. Hence M(c, s + E)< s for some
E > 0. But M(s, s + E) > S. Thus M(c, s + E) = s for some % < s, c in C,
E > O . Then for n = 1 , 2 , 3 , . . . ,
s=h:(s) = M(h:(c), h:(s
s)).
Theorem 8.1 can be found in Wimp [84] in slightly different form. This is
partly explained by the fact that exp[(logx + log y)/2] = G(x, y). For
Hardy, Littlewood, ,and Polya [59] or Wimp [84] this means that A >,,, G.
243
*(tx> =
44x1 + ~ ( t )
for some c(t) in R. Then $(t) = c(t) and thus [a(x) - 11l$(x) =
[$(tx) - $(t) - $(x)]l$(t)$(x) is independent of x if $(x) # 0. Thus
$(xy) = c$(x)$(y) + $(x) + $( y). If c = 0, then $(x) = C log x,
while if c # 0, h(x) := c$(x) + 1 solves h(xy) = h(x) A( y) whose
.
(8.2.4) holds.
solution is L ~ Thus
Show that (8.2.6) has solutions only of form (8.2.7). Hint:Consider
h(x) := log [@(x)]- log [@(I)]. Then h(xt) = a(t)A(x) + h(t). Now
proceed much as in b).
By considering M-, show that if M >, N, M strict and homogeneous, h isotone with t,(m) < m , then t,(m) < w.
Complete the proof of Lemma 8.l(a).
Complete the proof of Lemma 8.l(b).
Verify the final claims of this section.
Calculate the strong equivalence classes within G,,, and E,,s,
If M is a symmetric, homogeneous, strict mean, then either tM(0)=
0 or tM(m)= 03.
Similarly, suppose then that M >, N for some homogeneous N with
h monotone. Show that 0 or OJ lies in rng(h).
8.3 COMPOUND MEANS
We now formalize the notion of a mean iteration. The Gaussian AGM and
Archimedes' method provide the central examples. Let M and N be any two
continuous means. Let a > 0 and b > 0 be given and consider the iteration
ao:=a
bo:=b
a,+, := M a n , b,)
b,+* := N a n , bn)
Under mild hypotheses, Theorem 8.2 shows that the iterates converge to
a common limit, which we call the compound of M and N and denote by
"
244
M @ N(a, b). We will also denote the entire iterative process by [M, N]. We
say that M is comparable to N if one of the following holds:
(a) M ( a , b ) z N ( a , b )
(b) M(a, b) 5 N(a, b)
for
for
a,b>O
a, b > 0
(c)
for
O<a<b
for
0 <b <a
M(a,b)sN(a,b)
and
N(a, b) 5 M(a, b)
When M and N are symmetric, c) cannot occur, and we say M and N are
comparable. In the nonsymmetric case c) can occur, and then M is comparable to N but not conversely. In the next result comparability is only needed
to establish monotonicity of the iterates. (See Exercise 1 and Theorem.8.8
of Section 8.7.)
Theorem 8.2
Let M and N be means with M comparable to N.
(a) Suppose that M or N is strict. Then .EM, N] converges and M @ N is a
mean which is strict if both M and N are.
(b) M @ N is a homogeneous, symmetric, or isotone if each of M and N is.
(c) M @ N is continuous, and the convergence is monotone and uniform
on compact subsets of {(a, b)la, b > 0).
Proof
(a) Suppose that a > b and that M r N. Then a, = M(a, b) r N(a, b) = b,.
Inductively suppose that a, r b,. Then
(8.3.2)
and {a,) decreases while {b,) increases. Since each bounds the other,
both sequences converge, say, to x and y, respectively. By continuity
we have x = M(x, y) and y = N(x, y). Since M or N is strict, x = y. If
a s b, we may have to exchange the roles of a, and b,. Thus M @ N
exists and satisfies
(8.3.3)
This finishes (a).
MANsM@N~MvN.
245
The sequences {a,) and {b,) are in fact built by repeated composition
of means. Thus a, = Mn(a, b) and bn = Nn(a, b) for means Mn and N, .
These means are symmetric, homogeneous, or isotone when M and N
are, and so is the limit mean M @ N.
.
M,
Let a, b > O and E > 0 be given. Pick n so that la, - b,l < ~ 1 2Now
and N, are continuous. So we may find S > O with IM,(a, b) M,(ar, br)l < & / 2 and INn(a, b) - N,(ar, br)l < ~ / if2 lar - a [ < S and
Ibr - bl < 6, (ar, br >O). Again assume a > b and M 2 N. Then
M @ N(al, b')
5 Mn(ar,b r ) 5 M,(a,
b)
b) + E
and
n-m
Thus
@(a, b) = @(M@ N(a, b), M @ N(a, b))
246
Observe that we need not verify that @ is a mean, but only that
@(x, x) = x for x > 0 and that @ is a continuous solution of (8.3.4).
= G.
(b) Let M(a, b) := 9ab2/(a + 2b)2 and N(a, b) := (a + 2b) 13. Then
LetH,@H,=AG:[email protected],
( %)
A @ G ( a , b) = a A G 1, - =
an2K1(b/a)
@(a, b) :=
an2K1(bla)
Definition 8.2
Let M and N be symmetric means.
(a) Suppose M and N are comparable. Then we write
M@,N:=M@N
and
[M,N],:=[M,N].
Proposition 8.6
~ ( ab):=
, M(a
b,a v b).
248
2.
Hint: {a, v b,) and {a, A b,) are monotone sequences, and so
a, v b, decreases to x and a, A b, increases to y. One may suppose
x = (M v N)(x, y). Thus x = y by Proposition 8.4 and M @ N exists. Since M A N 5 M @ N r M v N, the limit is a strict mean.
b) M @ N is symmetric or homogeneous when M and N are.
a) Let M be a strict mean. Then
b) Show that
c)
M@N(a, b ) =
and
for a + b .
._ a,
an+' '-
+a
and
2
b,,+ := b,
+ 2l k K
'
249
4.
Prove Proposition 8.6 from Theorem 8.3. Note that in this setting
M I N * for O < a < b and M r N * for O < b < a .
5.
ii)
G@,G(a, b ) = a 113b213
iii) H P @ , H P ( a , b ) = (
a P -t2bP
'IP
pfO.
6 . For any function Q let &(a, b) := Q(b, a). Prove that for any means M
and N,
and
M @, N(b, N(b, a)) = N @, d ( a , b)
8.4
In this section we show that Gaussian iterations typically converge quadratically and Archimedean iterations sublinearly. We also give some more
examples of Gaussian and Archimedean iterations for which we can calculate
the limit.
(a) Let a, denote the area of a regular m -2"-gon inscribed in a unit circle.
Let b, denote the area of the circumscribed regular m .2"-gon. It is
and b,,, = 2a,+,b,l(a,+, + b,) while
easily verified that a,,, =
.
we geometrically vera, = $m sin (2nlm) and b, = m tan ( ~ l m )Thus
ify that a, and b, tend to T. This gives G O , H-,(a,, b,) = n or, on
using homogeneity and replacing 2 r l m by 0,
(8.4.1)
(!))
=0
(8.4.2)
+,
(w)= arccos):(
a, < bn
(w)
arccosh):(
bn +
a, > bn
2 arccos
bn+l
(8.4.4)
2 arccosh
Since an+llb,+l= d(a,lb, t- 1) 12, this is just the half-angle formula for cos
or cosh. El
We now turn to study rates of convergence for Gaussian and Archimidean means.
Theorem 8.5
251
(b) Consider the Gaussian iteration [M, N], for comparable M and N.
(i) Then, if a, # b,,
(8.4.6)
lim
n-+-
an,,
a,
- bn+l = 0 .
- bn
lim
Ian+l-
la,
',+I1
- b,I2
Thus
Thus
ancl - b,+, = o(a, - s) + o(b, - s) = o(a, - b,)
(since s lies between a, and b,). This gives (b).
Given
one more derivative, we have
(ii)
and a similar formula for b,. Subtraction gives (ii). Here we have
[Exercise
used V'M(S, s)(a, - s, b, - s)' = M,,(s, s)(a, - b,)'.
7d).]
Considerably more can be said if M and N are two or three times
continuously differentiable. (See Foster and Phillips [84b].) In addition the
convergence in each case is uniform. For our purposes Theorem 8.5 suffices.
In particular, Archimedean iterations characteristically converge linearly
and Gaussian iterations super linearly (quadratically if twice differentiable).
In light of Jacobi-type methods in the theory of equations, this may seem
counterintuitive. Exercise 3c) shows that (8.4.6) may fail if M and N are not
differentiable. The theorem justifies our separating Gaussian and Archimedean iterations. While symmetry is central to our convergence arguments, it
is not always essential, all that is really needed in (a) and (b)(i) Theorem
8.5 is that the two means have the same gradients on the diagonal. (See
Exercise 11.) Also, (8.4.7) shows that better than quadratic convergence is
possible only if M,, = N,,. (See Exercise 7.) Additional information is given
in Exercise 11.
Finally, let us say that two iterations are equivalent ([M, N]
[M', N'])
M' and N
N'. Strong equivalence is similarly defined. Clearly
if M
equivalence of iterations implies equivalence of the limits, but not conversely. Indeed, if the mapping h is continuously bi-differentiable, the rates of
convergence must be the same. (See Exercise 8.)
-,
-,
-,
a) Show that (8.4.1) and (8.4.2) are consistent with the general
formula of Exercise 5a) of Section 8.3.
b) Show directly that in both (a) and (b) of Example 8.2 we have
$ (a, - b,). This illustrates why computation of any
a,,, - b,,
large number of digits of rr by this method is impractical.
,-
c)
d,,, -limd,
d, - lim d,
2.
a)
+ 2bn)/3 =: d,
satisfies
v1- x2
c)
Osx<l.
The next exercise shows a class of means which is closed under the
Gaussian product. Contrast this with the Gaussian product of Holder
means.
3. Let Q,(a, b) := t(a v b)
+ (1 - t)(a A b) for 0 5 t 5 1.
o,
4.
o,
.\I-,
then
Explicitly,
a n + ,:= d a :
+ b i - anbn
and
bn+,
:=a
.- a: + bi + 6anbn
an+l * -
4(an + bn)
cbnverges to (a,
and
bn:=
3a:
+ 30; + 2anb,
4(an + bn) W,
+ b,) 12.
and
+-a
2
Ilk
and N(a,b):=
255
' 4
8
n-
41
a)
where m : = a ~ b E ,: = a v b, and d : = E - m .
256
x-m
m
(1) =--====
(2)
m-x
m
x-m
m
(3) =-== =
(4)
m-x
m
x-rn
x
(5) =r-==
=(6)
m-x
112
E-m
m
(7) -z-==
=m-x
E
(8)
E-m
(9) - = m-x
m
x
x-rn
m
==
m-x
x
x-m
m
=-== = z
m-x
m
x-m
m
=--=
=m-x
x
m
m
m
--x-m
m
=--
(10)
m-m
x
m
-
where a + a ' = 6 + 6'; re(a), re(al) > 0, and p is the beta function. (The
prime is not complementation in this context.) Then obviously
(8.5.2)
(8.5.3)
Cij:=Fi@q
i,j=l,2,3,4
Thus C12= AG, C3, produces Carlson's log (Exercise 3 of Section 8.3), and
C14= A @, G leads to Borchardt's algorithm (Theorem 8.4).
Theorem 8.6 (Carlson)
Let i, j = 1 , 2 , 3 , 4 with i .f j.
(a)
[Fi, q ] converges.
(b)
Cij(a,b)=[R(a;S,S';a2, b2)]- 1/ 2 n
(c)
and
Thus
3,
and
a':=
+ b2)-112dt .
- b2)lI4(t
and gives the arc length of the lemniscate (r2 = cos 28) from the origin to
the point with radial position x. (See also Theorem 1.7.) Thus
Similarly,
IOx+
(1
s4)-112ds
0) = K ( 1 / ~ ) by
, Theorem
8.4 (ARCHIMEDEAN
MEANS)Let us consider H, @, H, for p , q =
rtl, 0. Then the previous exercise and Exercise 5 of Section 8.3 show that it
suffices to consider Hl @,HI, Hl @, H,, and Hl @, H-,. Exercise 5c) of
Section 8.3 gives H, @, H,(a, b) = (a + 2b)/3 while H, @, Ho is Borchardt's
algorithm. It remains to study HI @, H-, . With a := /3 := $ , this is a special
case of the mean iteration a, := a > 0 and b, := b > 0,
EXAMPLE
a,,, := aa,
+ (1 - a)b,
bn+l:=
an+lbn
Pa,+, + (1 - P)bn
260
where 0 < a < 1 and 0 < p < 1. In the notation of Exercise 3 of Section 8.4
this computes G : = Q, @ , ( Q p ) - , for a > b. If we let k n := bnlan, we
derive
k-1
-1=(
nil
a p ) ( k i l - 1) = (ap)"+(: - I)
and
Thus
an+1
- -a
+ (1- a)kn
an
alb)
[ I1--a((aapp))"n((ll--alb)
Many other related algorithms can be found in Wimp [84]. This includes an
extensive discussion of quadratically computable trigonometric integrals (in
Chapter 14).
1. a) Show
that
R ( a ; 6 , s ' ; x2, y2) = y - 2 a ~ ( aS,6
;
6 ' ; 1 - x21y2),
where F is the Gaussian hypergeometric series. (See Exercise 6 in
Section 1.3.)
2
b) Show that R ( a ; 6, 6 ' ; x2, y 2 ) is homogeneous of degree - a in x
and y2. Also R ( a ; S , S 1 ; 1,l)= 1 and R ( a ; 6, S t ; ., -) is continuous.
c) Verify the entries in Table 8.1.
261
d) Show that [Pi, 51, i < j, is linearly convergent for all cases except
i := 1, j := 2. Observe that [F,, F,], which is not Archimedean, has
a convergence rate of 2-" and [F2, F,], which is partially comparable, has oscillatory iterates.
2.
as before.
b) Show that C,,(a, b) :=
a' - bL
2 log (alb)
( a # b)
m)2, and
=
5.
262
a) Show that
ab] = 2F(a, b) .
F[(?)',
b) Using a) (or directly) show that
IAw+
[(a
Let
k : = h + \l(h +.a)(A + b) + \l(h
k1:= \l=
and
+ bc)'.
and deduce that cn(vn) := anlb, satisfies v,,, = v,/2 and b,,,lbn =
va,+l/b,= an+,lbn+,= ~ n ( v ~ / 2 ~ + ' ) .
Recover
Borchardt's algorithm by considering k := 0.
c)
8. Compute a formula for the MacLaurin series for G(1- x, 1) in (8.5.9).
8.6 SERIES EXPANSIONS OF CERTAIN MEANS
For homogeneous means it is particularly easy to compute Taylor series. It
suffices to expand the trace around 1. In the exercises we list various series
taken from Gould and Mays [84], extending results in Lehmer [71].
Similarly,
From these it follows that the only means which are both Holder and
Lehmer means are H-,, A, and G. (See Exercise 5.) Similarly, but slightly
more elaborate, analysis shows that the following theorem holds.
Theorem 8.7
(Lehmer)
HpOgHq= Hs
(a)
if and only if p + q = s = 0.
(b)
Lp O , L q
if and only if p
(4
= Ls
+ q = 2 s = 0,1,2.
Lp O&,= Hs
if and o n l y i f p + q - 1 = s = - 1 , 0 , 1 .
(4
Hp 0, Lq = Ls
if and only if p = -q and s = $.
264
where
2.
for n L 1.
(Euler) If g(x) := C : = , anxnand gP(x) =: C : = , bnxn, then
where
1
D ( p , n):=
4.
a)
nP
N-l
C (-1)
k=o
P
n-k
where
) kp+k-n
n-k+l
D(p7 k)
b) Show that
m
C~
~ ~ , ~ ( 1x), :=
1 -
( sr,,n)xn
n=O
where
=1
and q = 1; or
a) Show that
pn
P I P 2 . .. Pn-,
ao:=l
bn :=
Pn+l+ 2x2"
P l P 2 . - Pn
b0:=1+4x
C,
..-- 2x2"
P I P 2 ..Pn
co := 2x
then
an + bn
an+, = 2
266
Most of the results of this chapter have direct analogues for functions of
more than two variables. Often these follow by similar. arguments. We
concentrate on mean iterations. Let Z := (a,, . . . , a,) be any strictly positive vector.
An N-dimensional mean is any continuous function M such that
( 6 for
) any permutation b of Z.
Homogeneity and isotonicity are defined analogously. The Lehmer and
Holder means are defined by
and
(8.7.4)
respectively.
Given N N-dimensional means M', . . . , MN, we consider the iteration
[M',...,MN]defined b y i & : = i > O a n d
i=l
i-1
..
, a n)
/\El
(8.7.7)
n j m
Proof.
(a) Much as before, a, r an+,r bn+,r b,, and we may suppose a, converges to a and bn converges to b. Let F be any cluster point of {%),
which is bounded. Then b 5 /\
ci 5 V Ni = l ci 5 a, and
El
268
Since all the means are strict, we must have ci = a and ci = b for each i.
Thus a = b, and the iteration converges, say, to &. As before, the limit
is a continuous strict mean.
(b) By symmetry we have MIk(ce)= 1 / N for any i, k and any multiple of
the unit vector e . Thus
i
a,,,
-a=
2 [(a* - a ) + o(a* - a ) ] . ,
k=l
Hence
- ai,+,l= o(a:
-a)
+ ~ ( a i-, a ) = ~ ( a -, b,)
a,,,
- a'+, =
h<k
O c a,,,
- b,+, 5
- b,I2)
8.5 (SCHLOMILCH)
Consider
C,+l:= (a,b,c,)
113
269
,,,,,,
where He(a, c) := (a VE
+ c) 13 = E ,(a, c) is the Heronian mean.
While this does not appear to have a closed form, it has an attractive
product expansion,
where xo := x and
and
:= 1.
3.
Show that S(1, x ) satisfies (8.7.9). Compare this to the product expansion for 2 ( l , x).
4.
a) Let
and
Show that
that M* 2 M*"
and that
&, M i = Ho.
i= 1
c)
PT,)
El
El
for p f 0, 1, 2. Also
(a - b)(b - c)(c - a)
b' c, := 2(b - c) log a 2(c - a) log b + 2(a - b) log c
1"'
while
S1(a' b'
( a - b)(b - c)(a - c)
:=I2(b - c)a log a + 2(c - a) b log b + 2(a - b)c log c
and, using f(x) := log, that the identric mean may be given as
9 ( a , b, c):= S2(a, b, c) := exp[
a2 log a
(a-b)(a-c)
c2 log c
log b
+ ( b -b2
a)(b-c)
c)
2,(a,, a,,
. . . , a,)
:= [(-I)N(N - 1)
c,=,
log a,
(a, - aj>
272
a) Observe that, while the means are not all symmetric, the derivatives
on the diagonal (a = b = c = d ) all coincide, and hence establish
quadratic convergence.
b) Show that when a, := b, and c, := do, the iteration reduces to the
AGM. This iteration shares many of AGM's attributes (Arazy et al.
[85l)
7. (An extended convergence result)
a) Observe that the convergence result Theorem 8.8 continues to hold
if the condition that the means are strict is relaxed to
N
a(q= V ci
i=l
i=l
i=l
ci =
ci .
i=l
Here
for n 2 0. Here *F
represents entrywise square root. Then A,
converges to a constant matrix with entry e(A).
c)
If A :=
( t) then e(A)
( 1 1)then e(A)
= AG(a,
b).
= a(a, b ) .
d) If A :=
e) Similarly consider A, and B, entry positive and
273
f(z>= [f(zln)ln .
where M(x) := M(l, x) and p(x) := N ( l , x)lM(l, x). Thus if both M and N
are algebraic, then L satisfies an algebraic functional relation with algebraic
274
i=O
(8.8.8)
i=l
To see that (a), (b), (c) and (a'), (b'), (c') are the same is equivalent to
Lindemann's theorem which guarantees that exp (a) is transcendental for
any algebraic a # 0. (See Exercise 7 of Section 11.2.)
We first prove part (a'). Suppose that p is an algebraic transformation for
exp. Then there exists an expression of the form
(8.8.9)
i=O
275
where the ai are nonzero algebraic functions in z and the m , and n , are
integers. Equivalently,
k
2 ai exp ( m i z + ap ( z ) ) = 0
i=O
This expression has one less term than (8.8.10) and contradicts the
minimality of (8.8.10) unless (8.8.11) contains no nonzero terms. This
implies that each term of (8.8.10) must be constant. Thus there exist m and
n integral and a nonzero algebraic function a ( z ) so that
(8.8.12)
a ( z ) exp ( m z + n p ( z ) ) = constant.
(8.8.13)
C ai[log ( z ) l m i .[log ( p ( z ) ) l n i= 0
i=O
where the ai are algebraic functions and the m i and ni are nonnegative
integers. Let (8.8.13) be minimal in the sense that it has the smallest
maximal degree and contains the fewest distinct terms of maximal degree.
(The degree of a term is m i + ni.) Suppose the term corresonding to i = 0 is
such a maximal term. If we now divide (8.8.13) by a, and differentiate, we
are left with an expression containing fewer maximal terms, which contradicts minimality unless all the transcendental terms vanish under differentiation. In particular (8.8.13) must actually be of the form
(8.8.14)
276
(8.8.15)
and we see that p is algebraic exactly when exp (dlc) is algebraic and -blc
is rational. (Otherwise, by Exercise 2, z - ~ "is transcendental.) This finishes
(b').
The proof of (c') is again similar to (a'). There are, however, a few
additional wrinkles. Suppose now that p(z) is an algebraic transform for
exp (exp (2)). Consider minimal length sums of the form
(8.8.16)
=0
where the ai are algebraic functions of z, exp (z), and exp (p(z)). Dividing
and differentiating (8.8.16) leads, as in (a'), to a contradiciton unless
(8.8.17)
and the result follows. One of the wrinkles is that we must establish the
transcendence of exp (exp (2)) over the algebraic functions in z, exp (z), and
exp (p(z)). This is necessary for our minimality argument and can be done
analogously to proving that exp is transcendental. (See Exercise 2.) Cl
The transformations we are considering are algebraic over 0. This is a
natural choice for computationally related questions. The analogous results
for algebraic transformations over C are similar but easier because the
number theoretic details vanish. We have the following:
Theorem 8.10
(a) TG(exp: @) = {az + b: a E Q and b E @).
(b) TG(1og: C)= {azb: b E Q and a E C).
(c) TG(exp (exp) : C) = {z + b : exp (b) E Q) .
If f(z) = g(a(z)), where a(z) is an algebraic function, then
277
' and log, (z), where q is an algebraic number distinct from 0 and 1,
TG(qz :Q) = {az
+ b: a and b rational)
1
F
To derive these results we first observe that (a') and (b') of Theorem 8.9
hold for the above functions (with exp replaced by qz and log replaced by
log,). The only difficult part is to show that qa and a are simultaneously
algebraic exactly when a is rational. This is the celebrated GelfondSchneider theorem. (See Section 11.2.)
z) and
In Exercises 1 and 5 we show that functions like /3(eY("),
/.?(log y(z), z ) , where P(., .) and y(.) are algebraic, cannot support functional equations that possess quadratic fixed points in their domain of analyticity. In particular such functions cannot be the limit of quadratically convergent homogeneous mean iterations. This shows why the elementary
functions are always associated with linearly convergent iterations.
Comments and Exercises
1. a)
Suppose that
278
c) Show that none of exp, log, or exp (exp) nor any function algebraically conjugate to the above functions is the limit of a quadratically
converging algebraic homogeneous mean iteration.
2.
a) Prove that za ( a irrational), log, and sin are transcendental functions over C(z).
b) Show that exp (exp) is transcendental over C(z, exp (z), exp (p(z)))
when p is algebraic.
j=O
279
1C s,(x)[f(l, x)ll = 0,
si
Eq
x)
and
(8.8.22)
2 'i(x7
Y)[~(x,y)I1= 0
C ri(M7N)[f(M, N)]' = 0
while by inyariance,
Observe that
b)' Let
and
c)
7.
Calculate
sin-' :C)
and
~ ~ ( c o s:-C)' .
Hint:
sin-' z = - i log (iz +
m)
.
Chapter Nine
9.1
1r
5
t
,'
Lemma 9.1
282
Proposition 9.1
(9.1.3)
(9.1.5)
T := T ( q , 6 ) :=
cot
(:)
+C
un sin (no)
n=l
1 "
n u n [ l - cos (no)].
T2:= T 2 ( q ,0 ) = 2 n=l
Then
Tl + T , = T ~ .
Proof.
[f cot (;)12+s1+ S2
T'=
where
1 "
S , := un cot
2 .=I
(:)
sin (no)
and
S2 :=
m,n=l
Now
1
- cot
2
(:)
1
1
sin (no) = - + C cos ( k o ) + - cos (no)
2 k=l
2
while
2 sin ( m 0 ) sin (no) = cos [ ( m- n)8]- cos [(m+ n)6] .
Thus
T'
[:
cot
(:)I
+C
k=O
at cos (kB)
a, = - 2 u n ( l+ u,,)= - C nu,
2 .=I
2 n=l
by (9.11). For k 2 1, S, contributes to a,,
and S, donates
m-n=k
... ..
...
n-m=k
..
'
m+n=k
Luckily,
and
Hence
) cos ( k 6 ) + 5
C kuk[l - cos ( k d ) ]
k=l
284
Now (3.2.23) can be used to show that the right-hand side is 8:(q). Thus
and this last expression is nonnegative for real q. (Hardy and Wright [60] use
Proposition 9.1 to deduce the formula for 0: from that for 8;).
For r = 1 or 3 let dr(k) denote the number of divisors of k congruent to r
modulo 4. Then
we also have
This is the formula exploited in Section 3.7. Also (9.1.9) shows that
(9.1.12)
8;(q) = 1 4
c
m
(-1)'n-'''2qnm
n,m=l
n odd
and
Now apply the triple product in form (3.1.1) to each of these sums. This
leads to
n (1
m
(9.1.14)
(a - a w l )
- a2qn)(l- a-2qn)(l-
qn)
n=l
286
where the last equality follows from (3.1.4) and (3.1.7). Thus
n odd
where ~ , ( n )is 1 if n = 8 k
+ 1, 8 k + 3 and -1 if n = 8 k + 5 , 8 k + 7 ,
1- q4"
%(~)0~(q~2
) = 4( - - z i ) q n = 4
n=1
1- q
n odd
n=l
n odd
qn + q-"
1 qZn+ q-2n
'
Hence
-6
1) I2 and Fn and Ln are the Fibonacci and Lucas
where p := (1
numbers, as in Section 3.7.
6. ,Use Ramanujan's modular identity of order 3 (Section 5.2) to deduce
that
288
There are some nontrivial considerations about the sense in which these
sums converge. (See Exercises 1 and 2.) We assume all sums denote limits of
rectangular summations. These will converge for re(s) > 0. The general form
of these rectangular sums is
289
Thus
290
and so
In particular,,
291
2C
j=l
(-l)Lmii(ml,m,, . . . , m,)
m.=O
C)
for all partial derivatives with i, < i2 < i, < - . < .,i
Verify that C; (-l)'+jf k(i2+ j2 + k2)-p and E; (-1)'+'(2i
converge for p > 0.
4.
a)
Show that
m
and
+ j)-P
292
b)
Show that
b8(2s) = -16C(s)a(s
- 3)
re (s) > 1
c)
h2(2s) :=
4(n, m)
re(s) > 0
-- [(n + m12)' + 3(rn12)~]~
where q(n, m) := ${sin [(n + 1)0] sin [(m + l)0] - sin (no) sin [(m 1)0]) and 0 := 27~13.This corresponds to calculating the Coulomb
potential on a regular hexagonal lattice. (See Borwein, Borwein, and
Taylor [85].)
a) For re(s) > 1, show using the cubic modular equation that
c)
L-,(s) := 1 - 2-"
4-"
- 5-
+ 7-"
8-
+ .. . .
293
and
which. are primitive L series modulo d. (These can only exist for d = P,
4P, or 8P, and the sign configuration is in fact uniquely specified.) Here
(kin) is the Kronecker (generalized Legendre) symbol. L,, is taken for
p = f1 (mod 4). For example, with P := 29, (9.2.8) becomes
.
i)
ii)
2'(-l)m+n(mz+ pn2)"
xt (-l)"'(mz
+ 2pn2)-"
( e )
and
?i-
-log f 4
fi
( m ) = L14,(1) log 2
m.
1 d-l
h(d) log ~ ( d =) - 5 2 (d / n) log (sin
n=l
(7)).
Moreover, for disjoint forms, one can observe that the class
number h(-d) must coincide with the number of genera g. The
number of genera is as follows. If d is odd, then g = 2"-', where d
has m distinct prime factors. If d is even and dl4 has m distinct
prime factors (including 2), then g = 2" when dl4 = 0 , 1 , 5 (mod 8)
and g = 2"-l otherwise.
An excellent brief survey of history and of recent advances
regarding the class number can be found in Goldfeld [85]. Now
observe that, for the appropriate P, both $ f 4 and $ f;' will be
products of powers of fundamental units from some of the divisors
of 4 P or 8P. [For our solvable P, h(-4P) = 2' when P = 1 (mod 4)
and h(-8P) = 2'.] In particular, verify that for P := 5, 13, or 37,
since t = 1 in each case
P := 21,33,57,93,
e)
f)
296
7.
-- T log -
V D
(1;)
We already know [Exercise 6c)l the first sum on the right. From some
elementary, hut skillful, theta transformations we may deduce that
a beautiful simplification.
a) Use (9.2.10) to compute h"(2P) for P := 3, 5, 11, 29.
b) Use (9.2.10) to compute h"(210) given in (4.6.12).
c) Observe, as Zucker did, that there are computable in this form
two larger singular values: those for 2 P := 330 and 462. Verify that
and that
8.
297
and
4 log ?,I(-)
-.=,c
1
h(-4r)
4r-1
(-4r1~(2)
Show that
b) Correspondingly
One may verify G,,,, and G,,,, from the corresponding cubic and
quintic equations, and so on.
c) Use the techniques of Section 5.3 to compute ~ ( 1 0 5 ) .
d) Observe that
i)
A*(6)=(2-fi)(fi-fi)
A*($)= A*'(%)=(2 - v 3 ) ( f i + fl)
ii)
iii)
q2
.
e) Indeed, for all P of type two, A*(2Q) and A*(2P) will be "conjugate", as we illustrate for A*(210). Let
300
f)
With these conjugate values and the formulae of Section 5.3 one may
observe that a(2P) is now available in closed form for all P of type two (and
for conjugate divisors Q). Again we illustrate with an exercise.
10. a) Generate a recursive version of formula (5.3.3) and specialize this
expression to produce a formula for cu(2dld2) in terms of R d l , Rd2,
Md2, M2 and the appropriate singular values.
b) Apply this formula with dl := 35 and d 2 := 3 to compute 4210).
Observe that this uses only values of A" given in Exercise 9e).
( /213
f i is incorCompute
a(42). Note: g:2 = (21/2 + ~ ) ' [fi)
c)
rectly, given in Weber [08].
a) Show that
if and only if
b) Show that
if and only if
and
m
ii)
C e(n)n-"=-. l(s)
n=l
Also
m,n=t
(9.3.1)
Proof. By symmetry,
odd
and since
302
r($)= -12,
and
odd
Thus
(9.3.3)
- b,(2)
= 16a2
5 sech2(:.\/m)
= 4 log 2
i , .i .. k = l
odd
and
m
- b2(0) = 277
which coincides with Exercise 7eii) of Section 3.7. (See Exercise 2.) For the
final evaluation it helps to know that for primitive L series
(3
(9
L+,(1 - s )
Show that
7-"
+ ..
. . . , a,).
Thus
and
3.
a) Show that
a) Show that
and
b)
iii)
rn,n=~
(m + n)S
= l ( s - 1) - l(s).
Thus
m+n+l
iv)
V)
c)
(-1)
= 4 4 s - 1)
-,
(Iml + 14)"
-gt
--
Q1
(Iml + lnl)"
= 4 3 - 1).
Show that
N
( N - I)!P,(s)
C a r l ( s - 1+ n)
n=l
- 3) - S(S - 2)
+ a {(s - 1) - J(S)
and
P&) = & [(s
- 4) -
- 1)
+ l(s) .
b)
Show that
c)
Show that
2
n,m=l
(-l)"+"
1
= - [(2 + 3-"a(s)
(3n + m)" 3
- a(s
- 1) - L-,(s, I)]
[-
1 7
log 2 - -and so L-,,-,(I, 4) = 3 3
6.
Show that
. (1 + x
~ ~=-(1~- xZN)
) /(I
- x).
306
9.4
valid for all complex z and q with I ql< 1 and z f 0. The proof is left as an
exercise. (See Exercise 1.)
If we divide both sides by 1- z-' and let z tend to 1, we derive
gonal identity.
2.
(n2+n)/2
. With f(q) := IIr=, (1 - qn)-l this bewhere g(q) := CF=, q
comes a formula due to Ramanujan
308
so that
Observe that this is slightly faster to compute than the original theta
series ratio.
e) Show that
so that
5. a) Show that
309
k=O
'
c)
9.5.
valid for 1x1 < 1 and Iql < 1. Verify that Cauchy's binomial theorem
is a special case. Hint: Begin by setting a := q-2n.
Use the q-binomial theorem to express the limit mean of Example
8.4 as a Taylor series.
QUINTIC AND SEPTIC MULTIPLIERS AND ITEZUTZONS
(4
(9.5.1)
5M, = 1+ ~G,,,/G;
(9.5.2)
1/M5 = 1+ ~ G , / G &
(9.5.3)
1 - 212 = M;(I
-llt -t
2 ) d m
Proof.
(a) These are given in Entry 13. Berndt [Pr] provides proofs which may
also be deduced from (4.1.20).
(b) These may similarly be found in Entry 14.
310
where for i = 1 or 2
and
E,
we obtain a
Proposition 9.3
Proof.
We begin with E,, as given by (5.2.13), and the explicit formula for R,.
We use Proposition 9.2(a) to rewrite s(r)R5 and 9.2(b) for the terms
involving 1' and k2. This leads reasonably directly to
311
qiT&Fij>.
Proposition 9.4
4 9 M 2 = 1- + - -I'- - 8 11'
k
k t kk'
1/
k
= 2-
(3213
.
k' - kk' - 8 +-
1'
11'
where wP(l2,k 2 )= 0.
Theorem 9.3
(4
(9.5.17)
312
Proof.
( a ) In each case one combines (9.5.16) with appropriate multiplier equations. For p := 5 use (9.5.1) and (9.5.2). For p := 3 use the identities
preceding equation (4.7.9). (The details are left as Exercise 6a).)
( b ) We use (9.5.11) and (9.5.12) to write
l l ' ~ , ~ ( l 8+ ~ ; =) kl'
+ Ik' - kk'
and
k k ' ~ ; ( 4 9+ 8 1 ~ ;=) k l r + Ik'
- 11'
(9.5.21)
+ Y = 11 + ( 5 ~ , ( q ) ) ~ .
(9.5.22)
where p ,
+ w1I7- 1 ) ~ / ( 7 N , ( q ) )
+ 1 =0
313
Knowing that a solution exists and exhibiting it, particularly in simple form,
can be very different matters. The components of the quintic and septic
algorithms for T are far less complicated than one might initially expect.
Both can be packaged very elegantly. (See Exercises 2 and 7, and Borwein
and Borwein [Pr].)
Combine (9.5.1) and (9.5.2) to obtain Schlafli's form of the quintic
modular equation.
Compute that
a)
(5 2
314
3.
c) Verify 4 7 ) .
d) Establish that R, and R, are as given in Table 5.1.
4.
a) Thus show
Then
8 l I 4 ( f i+ 1)
8 - l I 4 ( f i + 1)
and G:,, =
2
2
*
1
k,)= - .
c)
Show that
+ 8(11'kk')"3{(lk)1'3 + (l'k')1'3) = 1 .
b) Similarly
Chapter Ten
Abstract.
and let
These quantities are, respectively, the error in best uniform polynomial and
317
best uniform rational approximations. The existence of the best approximants is fairly straightforward and is left as Exercise 2.
The most commonly used polynomial approximations to exp are undoubtedly the partial sums of the Taylor series. This is reasonable since in any
neighbourhood of zero the partial sums are asymptotically optimal. (See
also Exercise 3.)
1
Theorem 10.1
(b)
IfpEPn,then
Proof.
"
i=n+l
1
1
~<-[l+n+2(~+;;;?
( a + l)!
+ ( n + 3 )1( n + 4 ) + .
.)I .
where s n is the nth partial sum to exp at zero, then by RouchC's theorem,
p(z) - sn(z)
and
ez - s,(z)
have the same number of zeros (counting multiplicity) in Dl. As ez sn(z) has a zero of order n + 1 at zero, we deduce the contradiction that
p = s,. To finish the proof we need only observe that
1
min lez - sn(z)l> lzl=l
(n + I)!
318
Theorem 10.2
Proof. Let
n
(101.6)
qn(z) := (2n)!
(:)(-Zlv
"=O
Then
Since
(10.1.9)
= JOw (t - z)"tne-' dt
(($)!
and
PI^ - rn,,
and
ez - r,,,
would both have the same number of zeros, and from (10.1.7) we would
deduce that p l q - r,,, has at least 2n + 1 zeros and hence is identically
zero. 0
The fact that r,,, is the Pad6 approximant is a consequence of (10.1.7),
which shows that
320
log
z+1
(=)
=2 ( t
1
+ S;i
+9
l + . - .)
z"-.l
lim -= log z
SJO
using PadC approximants centered at z := 1/(2n + 1). This is due to Trefethen [84]. (See Exercise 10.) The discussion of the approximations of exp
follows Newman [79], as do Exercises 7 and 8. These exercises illustrate the
different rates of convergence on disks and intervals. Exercise 8 is the n = m
case of a conjecture of Meinardus, namely, that
p(zi) = W ,
i = 0,.
. . ,n .
Show that
where
2.
3.
a)
lim sup
n-'m
Ilf-snllD,
4) - 1
/If-snIID,
8.
1.
Hint: Proceed as in 7). First observe that Exercise 7a) holds for
rational functions of degree n. The approximation is given by
Show that
zo
rn
~ r n , n ( :=
~)
m!(n + m - k)!
(m - k)!k! z
zo
J," tne-' dt = n!
P m ,n
:= Pm,n
n!
and
Qm,n
..-
qm,n
m!
325
b)
+ . + t,xn, then
Show that
du
326
Thus
Show that
e) Show that
f)
327
than those for log. (See Exercise 1.) We proceed to examine three methods
of complexity reduction. While none of these methods is as fast as those of
Chapter 7, they all have their own particular advantages. The second
method, based on the fast Fourier transform, for example, applies to most
of the special functions.
10.2.1 Acceleration Based on Functional Equations
and
where s, and r,,, are, respectively, the nth Taylor polynomial and the (n, n)
Pad6 approximant to exp at zero. Both of these above estimates provide
O
and
OB(v'EM(n))
(10.2.4)
and
OB(v'EM(n)).
These are good intermediate range estimates of exp and log. For
328
and consider
We can write
n-1
where
(10.2.10)
O,,(nl~'(log n)')
and
O,(nl"(log n ) ' ~ ( n ) )
~ , , ( n ' / ~ ( l n)')
o~
and
~ , ( n " ~ ( l n)'M(n))
o~
algorithm for log. This merely requires truncating (10.2.5), say, after n2
terms and evaluating the truncation using the FFT method in O,,(n(log n)')
steps. Note that this approximation provides 0(n3) digits of log.
Likewise algorithms can be constructed for exp and the trigonometric
functions with complexity given by (10.2.11).
The relative difficulty of implementing these algorithms renders them
largely of theoretical interest.
10.2.3 Acceleration Based on Binary Splitting
If we wish to evaluate the constant e by summing the Taylor series at 1, then
we can and should take advantage of the reduced length of each individual
operation. With this in mind consider
and
where
By construction,
330
a)
(y-z 2 z+ 1 + 1
= arctan
(;)I2
1 ~ 1 %q 5 1.
cos z + 1
c)
and
~,(n"~~(n))
4.
where
tN(rn) := (Nm)
XNm !
X
+
xN- 1
x2
+ (Nm + l)(Nm
+ 2) +..
+ (Nm + 1) . . (N(m + 1) - 1)
332
c)
Construct an ~ ~ , ( n ~ ~n)')
' ( l algorithm
o~
for the Bessel function of
order zero,
6. (An O,(n"'(log
T(s) = N
(-l)k N~
+
2- 7
k=O
IN
m
e-'ts-l dt.
c)
0,(n1"(1og n ) ' ~ ( n ) ).
7. ( ~ , ~ ( n ~ ~ n)')
' ( l oalgorithms
~
for hypergeornetric functions) We now
consider a (general) hypergeometric function to be a function
and
Observe that
d) Show that
V(y) :=
fl R(i + y)
i=O
e)
Q((n
- l>n>
f)
~ , ( n " ~ ( l no)~' ~ ( n ) )
whereqk:=2
d) Write
2k
2k-1
andOspk<2
T)
336
+ -+
. ..
Show that
(10.2.23)
-El(x)
=y
x>o.
+ log x + kC= 1 k . k!
Chapter Eleven
Abstract. The first section of the chapter deals with the history of the
calculation of n and related matters, while the second section deals with its
transcendence. The third section looks at irrationality measures and includes a
proof of the irrationality of c(3). This chapter is largely self-contained and
indeed contains considerable related number theory, especially in the exercises.
11.1
The history of .rr presumably begins with man's first attempts at estimating
the perimeter or area of a circle of given radius and as such starts at the
dawn of recorded history. The Egyptian Rhind (or Ahmes) Papyrus which
dates from approximately 2000 B.c., gives a value of (1619)' = 3.1604. . . for
T. Various other early Babylonian and Egyptian estimates include 3, 3 $ ,
and 33. Implicit in the Bible (1 Kings 7: 23) is a value 3: "And he made a
molten sea, ten cubits from the one brim to the'other; it was round all
about. . . and a line of thirty cubits did compass it round about."
Mathematical interest in n comes into sharp focus in the classical Greek
period. The Greeks investigated the problem of "squaring the circle.'' This
question and its final resolution over two millenia later will be pursued in
the next section. Currently we wish to review the primary Western developments in the calculation of n.
Archimedes of Syracuse (287-212 B.c.)provided the first major landmark
in the quest for digits of n. By considering inscribed and circumscribed
polygons of 96 sides, Archimedes gave the estimate
This appears in his Arithmetica In$nitorum.of 1655. A few years later Lord
Brouncker (1620-1684), the first president of the Royal Society, recast this
as the continued fraction.
m,
which is essentially an arcsin expansion. (See Exercise 4.) Newton was later
to write: "I am ashamed to tell you to how many figures I carried these
computations, having no other business at the time." John Machin (16801752) derived the formula which bears his name:
rr
- = 4 arctan
4
Coupled with Gregory's series for arctan this provides a very attractive
method for calculating rr since the first term is well suited to decimal
arithmetic and the second term converges very rapidly. Machin calculated
100 digits this way in 1706. In the same year William Jones published his A
and
.=I
The explicit summation of (11.1.12) had eluded Liebniz and also the
Bernoulli brothers, Jacques and Jean. The method by which Euler derived
his evaluations of C;=, 1lnZkis outlined in Exercise 7. This is to be found in
Euler's Introducio in Analysin Infinitorurn of 1748. The Machin-like formula
(11.1.14)
'IT
= 20 arctan
'IT
- = arctan
4
(i)+
arctan
Thus dawns the computer age. In June 1949 ENIAC (Electronic Numerical Integrator and Computer) was used to evaluate 2037 digits of rr using
Machin's formula and 70 hours elapsed time. An analysis of the distribution
of the digits was carried out by Metropolis, Reitwiesner, and von Neumann.
By 1958, Genuys had computed 10,000 digits on an IBM 704 in 100 minutes,
once again using Machin's formula. Felton had performed a 10,000-digit
calculation in 1957; however, due to machine error it was only correct to
7480 digits. In 1961 D. Shanks and Wrench [62] used the identity
(11.1.18)
~=24arctan(!j)+8arctan(-&)+4arctan(&)
and under 9 hours on an IBM 7090 to produce 100,000 digits of T.This was
checked using the formula
1
(11.1.19) T = 48 arctan
+ 32 arctan
- 20 arctan (%) .
(&)
($)
The million-digit mark was set by Guilloud and Bouyer in 1973 on a CDC
7600. The calculation, which took just under a day, used (11.1.19) with
(11.1.18) as a check.
. Kanada, Tamura, Yoshino, and Ushiro [Pr] calculated in excess of 16
million digits using an AGM based algorithm, Algorithm 2.2, and checked
10 million digits using (11.1.19). The 16 million-digit calculation took under
30 hours on a HITAC M-280H and used an FFT-based fast multiplication.
At the end of 1985 the record belonged to W. Gosper. He calculated 17
million terms of the continued fraction expansion for T and so in excess of
this number of decimal digits-after a radix conversion from a binary
computation. His method is based on a very careful evaluation of Ramanujan7s series (5.5.23) on a Symbolics 3670. (A remarkable feat considering
the size of the machine.) As is surprisingly often the case with these large
scale calculations, Gosper uncovered subtle design flaws which had not
surfaced in smaller calculations.
In January 1986, D. H. Bailey [88] computed 29,360,000 decimal digits of
T on the CRAY-2 at the NASA Ames Research Center. This calculation
used only 12 steps of the quartic algorithm (5.4.7) with r := 4. This results
in computing a(z5O), which agrees with T-' to more than 45 million places.
The calculation took less than 28 hours and was verified with a 40-hour
computation of 25 steps of Algorithm 2.1. It is amusing to observe that the
quartic calculation requires well under 100 full precision multiplications,
divisions, and root extractions.
In July 1986, Kanada reclaimed the record with a computation of 2''
decimal digits. He again used Algorithm 2.2, verified in September using
(5.4.7) with r := 4, but reduced the elapsed time to 5 hours and 56 minutes
on a S-810120 super computer. This represented a speed-up by a factor of
15. His previous computation now used only 96 minutes of CPU time. Plans
were to compute 227 decimal digits (over 100 million) at the end of 1987.
Nor is the end in sight. It will probably be the case than hundreds or
thousands of millions of digits will be calculated by the end of the century.
(This is now more a matter of will than anything else. Indeed, by the time of
second printing Kanada had similarly computed 201 million digits.) Apart
from observations like "the sequence 314159 appears in the digits of n
commencing at digit 9,973,760," there is little we care to say about the
digits. They have, however, been subjected to considerable scrutiny. It is an
open question as to whether a is normal. That is, do all sequences of
integers appear with the same frequency in the digits (are one-tenth of the
digits 7, one-hundredth of the consecutive digit pairs 23, etc.)? On the basis
of the first 30 million digits the answer appears positive. This, of course, is
no great help in deciding the normality issue. (See Wagon [85].)
In terms of utility, even far-fetched applications such as measuring the
circumference of the universe require no more digits than Ludolph van
Ceulen had available-but then utility has had little to do with this particular story.
Comments and Exercises
This section presents only the highlights of the quest for digits. The matter
may be pursued in detaiI in Beckmann [77], a most useful though rather
individualistic history, and in Le Petit Archimdde [80]. Schepler's chronography [50] and Wrench's history [60] are also of interest. Details of the more
recent calculations may be found in Tamura and Kanada [Pr], where a
compendium of Machin-like identities is provided.
There is also a considerable collection of T-related trivia. For example,
the Indiana House of Representatives attempted to legislate the value of .rr
in Bill 246 of 1897. The bill, which appears to proclaim n to be several
different incredibly inaccurate values, including 4 and 64/25 (see Beckmann
[77], and Singmaster [85]), passed the House and only floundered in the
Senate on the apparently chance intercession of C . A. Waldo, a professor at
Purdue. Keith [86] gives a 402 digit mnemonic for T.
Show that the algorithm of (11.1.1) and (11.1.2) calculates n by
showing that a, and b, are as advertised.
Prove (11.1.3) and estimate how many iterations of (11.1.1) and
(11.1.2) are required to calculate 35 digits of T . This should be
compared to Bailey's [88] calculation which uses the same operations.
Prove, from the product expansions for sin and cos, that
sin 8
(11.1.20) 8 =
($1 < a .
cos ($12) cos ($12') cos ($12~).Alternatively deduce (11.1.20) in an elementary fashion by setting
In:=cos
($1 . - . (t)
(;)
cos
In=
sin 0
2" sin (012")
cos
'
3.
m to
and
4.
and that
344
Pi
5.
- = 4 arctan
4
(i)
- arctan
(&)
2 tan 0 - -5
1- tan2 0 12
and
Hence
tan 40 - - =
-1+tan40 - -1
1 tan48
239
'
arctan
(i)
= arctan
(-)p +1q
(-)
X Y < ~ .
+ arctan (
arctan
(&)
+ arctan (&)arctan (i)
.
=
(Machin-like formulae)
a) Show, for integral ai and bj, that
k v = arctan
345
~r
(-1.
():
+ n arctan
- = 4 arctan
-=
5-
-=
-=
(i)
+ i)"
- arctan
kT
is real.
(A)
(i)
(f) + arctan
2 arctan ( f) arctan (f )
2 arctan (i)+ arctan (f )
arctan
(t) 7
(Machin, 1706)
(Euler, 1738)
(Hermann, 1706)
(Hutton, 1776) .
These are, in fact, all the nontrivial solutions of b). This was a
problem of Grave's solved by Stprrmer in 1897. The problem can
be reduced to finding integral solutions of 1+ x2 = 2yn or 1+ x2 =
yn, n r 3, n odd. (See Ribenboim [84].) Much related material on
Machin-like formulae occurs in Lehmer [38] and Todd [49].
7.
Hint: If
s := a,
then
+ a, + a,a2 + a,a2a3 + - .
so that
and
X
arctan x =
x2
1+
3-x2+
9x2
25x2
5-3x2+ 7-5X2...
'
and
21 - 1
( k ? ~ ) 90
~
k=l
Use the product expansion for sin [Section 2.2, Exercise Id)] to make
the above argument of Euler's rigorous. (See also Exercise 14 of
Section 11.3.)
9. In computing r from (11.1.19) one must evaluate arctan ( & ) and
arctan ( $ ). Use (11.1.15) to observe that
I
L
k
347
Thus terms of the second series are just decimal shifts of terms of the
first series. (See Ballantine [39].) How does this affect the'complexity
of calculating the two arctans?
10. Prove that the number
0.12345678910111213~. . n(n + 1)
- ..
?r
349
that any nonvanishing linear combination of logarithms of algebraic numbers with algebraic coefficients is transcendental. That is, if ari and Pi are
nonzero algebraic numbers, then Po + Xi"=, a,. log Pj # 0. (See Baker [75].)
Note that ii = e-"I2, and so transcendence of e" follows from the GelfondSchneider theorem.
;
angle)
d) Show that the constructible numbers form a field. See the hint
provided in the illustration. Hint:
+ 11,
a : =10 z ~and
1
,,=I
p:=xn=1
lo2"'
x"(1- x)"
n!
has the property that f and all its derivatives are integer valued at
x = 0 and x = 1 and that for x E (0, I),
is an integer but that the left-hand side above lies strictly between 0
and 1 for large n.
d) Deduce from c) that eq is irrational for all rational q #O.
e) Show that rr2 (and hence a ) is irrational.
4"Outline: Suppose .rr2 = alb with a, b E N . Let f be as in b) and
consider
Then
d
dx
and
rr
jolanf(x) sin ( m ) dx
= G(O)
+ G(1)
let
and let
Observe that S + T = 0.
c) Show that, for infinitely many m, Slm! is a nonzero integer. To do
this, observe that if y := x - k, then
n)l(x+ 1)
whence
(On the transcendence of r r ) The exercise outlines Baker's [75] synthesis of Hilbert's proof of the transcendence of n.
355
?r
Show that expanding the above yields a nonzero integer n such that
where
while for j = p - 1,
f "-"(0)
- - . a,)'.
= ( p - I)!(- l)nP(ala;
and
p !'4 f('-')
(0)-
Pi
356
Show that
Hence with d)
and
7.
357
Lindemann's Theorem
If a,, . . . , a, are distinct algebraic numbers and P,, . . . , P, are nonzero
algebraic numbers, then
a) Suppose that
For 1 5 i 5 n, let
where
Show that
( P-
1 f :j)(ak)
. . . ,1&
are
j=p-1,
k=i
while
!
1 f(
a )
otherwise
IJ,.
I J,I
of Q. Show that
b)
Show that
i)
A P Q ) = p*.(P)AQ)
ii)
p ( P ) 5 length (P)
iii) p ( x - c ) = p ( x - I c I )
iv) p((x - cn)"") = p(x - c)
v> p(x - c) = m M l , IcI)
. Then
Pi
360
and
e'l+f2+"'+f,,
dt, . . . dt, .
c) Thus
where
and where
Hint: Use the residue theorem to see that V,,, is of the right form.
361
?r
v,,,
= O(x
(n+l)(m+l)-1
1.
Note that
for x#O.
akjxiekx
Osksn
Osjsm
where the akj are integers. Use c) and the fact that d, < en('+") for
large n (Exercise 6 of Section 11.3) to show that for m + n large,
and
has infinitely many integer solutions in p and q for almost all a exactly when
usually has only finitely many solutions. This result is due to Khinchin [64].
(See also Exercise 2 of Section 11.2 and Exercise 1 of this section.)
It is standard to the theory of continued fractions that if
k=0
(11.3.8)
tn :=
(2n - k)!
(n - k)!k!
( n - k)!k!
then
s,
Ie-<l=-
1 log log tn
2 t: log tn [ I + O ( ~ ) I
and
(11.3.11)
The result now follows, as an application of Stirling's formula gives
2n+1=
2 log t,
[ l + o(l)]. 0
log log t,
Theorem 11.2
If p , q E Z and
. . .].
This, with Theorem 11.1, shows that p,/q, are the convergents for the
simple continued fraction for (e - 1)/2 and that expansion (11.3.13) holds.
(See Exercise 2.) In particular, if
The above corollary due to Davis [79] shows that el'" is atypical with
respect to the rate of rational approximation [compare (11.3.4) and
(11.3.5)]. For v = 1, Corollary 11.1 can be deduced from Theorem 11.1
without reference to Theorem 11.2 on continued fractions. (See Exercise 4.)
For general v the proof is left as Exercise 5.
We now turn to irrationality measures for T , l ( 2 ) , and 5(3). The
approach follows Beuker's [79] elegant treatment of ApCry's startling proof
of the irrationality of J(3).
Lemma 11.1
Let r and s be nonnegative integers.
for some n in E
r>s
1
tb)
Id Id
- xrys log xy
- xy
dx dy
r=s>O
for some n in
Proof. Consider
Pi
366
Theorem 11.3
1
n!
p,(x) := -
(11.3.22)
k=O
()
(n + k)!
n!k! ( - l ) k ~ k
I" .* --
lo1
16
( 1 - Y )pn("1 dX dY
1 - xy
(11.3.24)
(fi
- 1)'/2] we have, by (11.3.24) and Lemma
367
On the other hand by the same lemma applied to the first form of I,,
where
~=&(n)(;)
(n + k)!
! !
where c, > 0, c,, c,, and c, are constants. In fact, van der Poorten [79] gives
6 :=
Theorem 11.4
For p and q integers and q sufficiently large,
and
.> O .
for sufficiently large q, then plq = anlyn for some n. [Here a,, y,, and
6 := 0.092. . . are as in (11.3.31).] Thus we need only verify (11.3.33) for
p l q = an/yn.This follows from the observation that, for small q > 0 and for
large n,
One now finishes the result by using estimates (11.3.29) and (11.3.30) to
show that
[
(- 1 1 2 - ]
2->3
Yn
1
11.86
Yn
The first irrationality measure for .rr was due to Mahler [53] (see also
Mahler [67]), who showed that
1
(which can be marginally sharpened).
q large
Theorem 11.5
[(3) is irrational.
Proof. Consider
where, as in (11.3.22),
We observe that
where the last equality follows from an n-fold integration by parts with
respect to y. For 0 5 x, y, w 5 1,
370
Proof. The proof is similar to the proof of Theorem 11.3 and 11.4. One
first calculates explicitly from (11.3.35) and (ll.3.23),
c,nc2(1+ 1/2)4n5
(11.3.42)
+ 16)~"
I
:c3nc4(l
The remainder of the proof follows much as in Theorem 11.4, and the
details are left as Exercise 8.
Similar considerations lead to the irrationality measures for log.
Theorem 11.6
Let p, q , and n be integers. Then
(11.3.44)
and for any
I b P ~>p
I
E
q large
q large .
371
behaves like 3 log nl(log log n) (rather than the expected c log n that holds
for almost all numbers).
Transcendence estimates of type
>
1
qc log log
c a constant.
has infinitely many solutions has (Lebesque) measure zero. (The other
half of Khinchin's theorem is more delicate.)
Hint: Fix N. For each q > N consider intervals of radius f(q) /q around
the q points O/q, 1/q, . . . , ( q - l ) / q . The measure of the union of all
these intervals is bounded by
2.
a1 +
a2 +
an
and let
[a,, a,, . . .I := f,i,mm
[a,, a,, . . . , a,].
Deduce that pn and q, are relatively prime. Note that {p,) and
{ qn) are increasing sequences.
b) Show that continued fractions are well defined, that is, show that
the limit in the definition always exists.
Rational
numbers have two representations since
c)
[a,,
and
. . . , a,]
= [a,,
. . . , an-, + 11
,-,,
a, = 1
d) Let
4 := [a,,
373
Thus {~z,klz,) T a and ~z,+,lq2,+1) ae) One constructs the simple continued fraction to a as follows
(1 1 := integer part). Let
a,:=La]
and
a,:=&
. . .]
Hint: Prove I P, - qna 1 < - q a 1, Show first that one may asp:=up,+
sume 9,-,<q<q,.
Now write q : = u q , + v q
vpn-1, and ~ n - q n a = u ( ~ n - q n a ) + v ( ~ n - l - q n - i a ) .
Show
that u and v are nonzero integers and that u(p, - q,a) and
v(pn-, - qn-Ia ) have the same sign.
h) Show that if for z > 1,
and
,-,,
then
(Y
= [ a o , . . . , a n , z]
and
z = [ a n + l , a n + , 7- . . I .
[There are two representations for p l q [part c)]. Choose the one
for which n satisfies ps - qr = (-I)"-'.]
i) Show that if
then p l q is a convergent of a.
Hint: Suppose
and
Write
Fl:=F,:=l.
Fn+,:= Fn+ Fn-,
Show that {F,+,IF,) are the convergents to (1 + f i ) / 2 . Show
that
1) Show that
= [a, 2a, 2a, 2a,
. . .].
Pi
and
d a positive integer
a.
and
Then either
=
4
or for
4,
for some n
Hint: Let l a - p l q l = l l q
$q:+*':q < $q;. Then
1+1/6+&'
E'
4.
and
a) Show that
and that
Hence
Pi
Hint:
n I
(r2. :,=
pi)
(2n) !
and
(2n)! 14"
n!n!
where the product is taken over the primes pi, n < p i 5 2n. Thus
b) Let ~ ( n denote
)
the number of primes less than or equal to n. The
prime number theorem asserts that
n
~(n)log n
(See, for example, Hardy and Wright [60].) Use this to prove that
.dn= O(esn)
Hint:
d,
npfi
5 nT(n)
where the product is taken over the primes 5 n and where ai is the
largest integer so that pfi 5 n.
7.
a) Show that
,=e,
b) Show that
Show that
(x
+ a,)
and
(This exercise follows van der Poorten [79] and is essentially due
to Apkry.)
8. Finish the proof of Corollary 11.2.
9. (Irrationality measures for log)
a) Let p,(x) := {dldx)'xn(l
- x)".
(i )( n
k , rnk
behaves like
Pi
ffn
=-
dn
+ p,(-1)
log 2
Estimate / I / from above and below by computing that the maximum of x ( 1 - x ) l ( l + x ) occurs at x : = a - - 1 . For large n,
c)
In :=
1+ xlm
Show that B,(q) and B,(q) are irrational for rational q := 1 In,
n = 1 , 2 , 3, . . . .
b) (Euler) Show that m4 n4 = p 2 with m, n, p integral implies
mn = 0.
C) Use b) to, show that B2, B3, and 8, are never all nonzero and
rational.
10. a)
Outline: Let
and
Show that
and that
c)
Pi
382
ii) ~
3
4
i , ( =~-J ~)i , ( l +
) - log -15
7r2
(3-2v3)
e) Show that
j,logrM+l)121
5(3) = Li3(l) = 10
t2 coth (t) dt
a)
x--
a-x
(X- n)(a - n)
383
a) Show that
cot (TZ) =
2~iz
+ riz
e2rriz- 1
c)
Show that
d) Thus
15.
(Evaluation of P(2m
(11.3.62)
+ 1))
-=
cos z
n=o
b) Show that
(2n)!
1.1 < 5 .
where, as before,
c)
Show that
d) Thus
)(:
a) Show that
Show that
--
x
-- x2 2 + x arcsin
2 312
1-x
(1-x)
'
2 ( - ~ ) ~ --l 1
--
v3+1
log 25
2
e)
Show that
and that
Thus
and that
( 2 n + l)l6"
=-
'
Bibliography
[I8811 N. H. Abel, Oeuvres Complktes (Grondahl and Son, Christiania, 1881).
1641 M. Abramowitz and I. Stegun, Handbook of Mathematical Functions (Dover, New
York, 1964).
W.
W. Adams, "Asymptotic Diophantine Approximation to e," Proc. Natl. Acad. Sci.
[66]
55 (1966), 28-31.
[66] L. V. Ahlfors, Complex Analysis, 2nd ed. (McGraw-Hill, New York, 1966).
[74] A. V. Aho, J. E. Hopcroft, and J. D. Ullman, The Design and Analysis of Computer
Algorithms (Addison-Wesley, Reading, MA, 1974).
[79] K. Alladi and M. L. Robinson, "On Certain Irrational Values of the Logarithm,"
Lecture Notes in Mathematics 751, (Springer, Berlin, 1966).
[Pr]
[prl
Ellipses, the Ladies Diary, m, and the American Revolution," MAA Monthly (in press).
G.
E. Andrews, The Theory of Partitions (Addison-Wesley, Reading, MA, 1976).
[76]
,
"An Introduction to Ramanujan's 'Lost' Notebook," MAA Monthly 86 (1979),
[79]
89-108.
"The Fifth and Seventh Order Mock Theta Functions," Trans. AMS 273 (1986),
[86] -,
113-134.
1741 T. M. Apostol, Mathematical Analysis, 2d ed. (Addison-Wesley, Reading, MA, 1974).
[76a] -,
Introduction to Analytic Number Theory (Springer, New York, 1976).
[76b] -,
Modular Functions and Dirichlet Series in Number Theory (Springer, New York,
1976).
[85] J. Arazy, J. Claesson, S. Janson, and J. Peetre, "Means and Their Iterations," in Proc.
19th Nordic Congress of Math., J.R. Stefhson (ed.), Reykjavik (1985).
[80] R. Askey, "Ramanujan's Extensions of the Gamma and Beta Functions," MAA
Monthly 87 (1980), 346-359.
[81] R. Backstrom, "On Reciprocal Series Related to Fibonacci Numbers," Fibonacci Quart.
19 (1981), 14-21.
D.
H. Bailey, "The Computation of rr to 29,360,000 Decimal Digits Using Borweins'
[88]
Quartically Convergent Algorithm", Math. Comput. 50 (1988), 283-296.
[35] W. N. Bailey, Generalized Hypergeometric Series (Cambridge Univ. Press, London,
1935).
A.
Baker, "Approximation to the Logarithms of Certain Rational Numbers," Acta
[64]
Arith. 10 (1964), 315-323.
Transcendental Number Theory (Cambridge Univ. Press, London, 1975).
[75] -,
[81] G. A. Baker and P. Graves-Morris, Pad&Approximants, vols. I and I1 (Addison-Wesley,
Reading, MA, 1981).
388
Bibliography
Bibliography
389
Bibliography
Bibliography
391
Bibliography
392
Bibliography
393
-, "Application
of Some Formulae by Hermite to the Approximation of Exponentials and Logarithms," Math. Ann. 168 (1967), 200-227.
D. Masser, "Elliptic Functions and Transcendence," Lecture Notes in Mathematics 437
(Springer, Berlin, 1975).
G. Miel, "On Calculations Past and Present: The Archimedean Algorithm," M A A
Monthly 90 (1983), 17-35.
M. Mignotte, "Approximations Rationalles de ?r et Quelques Autres Nombres," Bull.
Soc. Math. France, Mem. 37 (1974), 121-132.
D. S. Mitrinovic, Analytic Inequalities (Springer, New York, 1970).
L. J. Mordell, "Note on Class Relation Formulae," Messenger of Math. 45 (1916),
75-80.
G. Nemeth, "Relative Rational Approximation of the Function ex," Math. Notes 21
(1977), 325-328.
R. Nevanlinna and V. Paatero, Introduction to Complex Analysis (Addison-Wesley,
Reading, MA, 1969).
D. J. Newman, "Approximation with Rational Functions," AMS Regional Conf. Ser. in
Mathematics, no. 41 (1979).
-,
"Rational Approximation Versus Fast Computer Methods," Lectures on Approximation and Value Distribution (Presses de l'Universit6 de MontrBal, Montreal, 1982),
pp. 149-174.
-, "A Simplified Version of the Fast Algorithms of Brent and Salamin," Math.
Comput. 44 (1985), 207-210.
M. Newman and D. Shanks, "On a Sequence Arising in Series for T," Math. Comput.
42 (1984), 191-217.
I. Niven, "Irrational Numbers," Carus Math. Monograph 11 (MAA, 1956).
M. Nyvoll, 'Tilnoermelsesformler for Ellipsebuer," BIT 25-26 (1978), 70-72.
J. Oxtoby, Measure and Category, 2d ed. (Springer, New York, 1980).
G. M. Phillips, "Archimedes the Numerical Analyst," M A A Monthly 88 (1981),
165-169.
-, "Archimedes and the Complex Plane," M A A Monthly 91 (1984), 108-114.
Bibliography
395
Symbol List
Symbol
Description
Arithmetic-Geometric Mean Iteration
Section
Formula
1.1.1 and
1.1.2
1.1.5
1.1.11
Complementary modulus
1.3.3
Ef(k):= E(kl)
1.3.4
Hypergeometric function
1.3.5
.. :(2i-1)
(2i-1)!!:=1-3.5.
1.4.4
1.4.5
( x ) :=
e t
Gamma function
1.6.4
Symbol
Description
( x , ) :=
1-t
)d ,
Beta function
Weierstrass function
r2(n):=number of representations of n
as a sum of two squares
Oi(s):= Oi(q)
where
q := e-""
k ( q ) := k = '3;(q)1'3:(q)
k 1 ( q ):= k' = 0 : ( q ) 1 ' 3 : ( ~ )
W )=
639)
= e-K'(k)IK(k)
k(s) := k ( q )
where
q := e-"
A*@):= k ( q )
where
q = e-&
Section
Formula
Symbol
Description
Section
3.2
3.2
q 1 1 1 2 ~ o , Eta function
3.2
q-lIz4~3
3.2
v9q"'2~l
3.2
q-1'24~z
3.2
3.2
3.2
3.2
3.3
3.3
3.5
3.5
w(n) := ul(n)
+ ul(odd (n))
3.5
9-1
S(p, q) := C,
e-""Z~'4
3.5
r=o
M( f ) := M*(f ) :=
f(x)*-ld.
Mellin transform
3.6
c,
[(s) :=
C n-'
n=l
L(p):=
pn
x- -
"=I 1- p n ,
3.6
3.6
(PI < 1 ,
Fo:=O,Fl:=l,F,,+,:=F,+F ,-,,
Lambert series
Fibonaccinumbers
3.7
3.7
(Ex. 3)
LO:=2,L,:=l,L,+,:=L,+L
,-,,
Lucasnumbers
3.7
X := { i m ( ~>) 0)
X*:= X U {im)U {Q)
4.3
4.3
4.3
4.3
4.3
4.3
Formula
Symbol
Description
Section
4.3
(Ex. 1)
4.3
( E x . 1)
4.3
(Ex.1)
4.3
(Ex. 1)
J(t) := -
- A(*)
h2(t)(l - ~(t))' '
+
27
{(:
4.3
:):ad-bc=p;a,b,c,dEZ
transformations of order p
ji:=joAi
r=o
n , ( v , U ) :=
I1 (V - u ; )
i=O
Mp(l, k ) : =
O'(q) K(k)
-e ; ( q l l p ) K(1) '
Multiplier of order p
4.6
Formula
Symbol
Description
Section
~ ~ (k )1: =
, -.pkk"
d M p (I, k ) + M;'(I, k)12- pk2
MP(Lk ) dk
a( p) := RP(k1,k ) ,
k := e-+
5.2
m,, := ~ ; ' ( k , , ,k + , )
:= [rn,r,
+ m : ( l + k:) - p ( l +
5.4
5.4
k:+,)]/3
5.4
a,,,, := m i a , - pnv%,,
T(a + n )
(a)" := -= a(a + 1) . . . (a + n - I ) ,
r(a)
G :=
.=o
b , = fl(a,)
.l.Z!L
(2n + 1)' '
5.4
5.4
r" := RP(k">k"+J
E,
5.2
rising factorial
Catalan's constant
if b, = O(a,)
and
a, = O(b,)
5.5
5.6
(Ex. 10)
6.1
Bit complexity
6.1,
Operational complexity
6.1
6.2
6.4
6.3
(Ex. 3)
6.4
6.4
a A b := min(a, b )
8.1
a v b :max(a, b )
8.1
8.1
- {O)
Mf(a, b ) := f - ' ~ ( f ( a ) f@))
,
Mp denotes Mf when f(x) = xP
8.1
8.1
8.1
Formula
Symbol
Description
a
b)=
aP +
bP
Section
'/P
a, b > 0.
Holder means
Lehmer means
Stolarsky means
3 ( a , b ) := &(a, b ) ,
Logarithmic mean
$(a, b ) := &(a, b ) ,
Identricmean
dominance
M ( 4 ( a ) , 4 ( b ) ) = (P(N(a,b ) ) ,
if M > 4,N and N > &M,
equivalence
Compound Mean of M and N
M comparable to N
Cij:=Fi@q
b, a v b)
(i, j=1,2,3,4)
Formula
Symbol
Description
Section
8.7
L p ( 4 :=
H i ) :=
Lehmer means
(i
a,)/(:
ar-q ,
,=I
,=I
(N 5 a ) ' , HOlder means
8.7
8.7
i=1
8.7
<+,= M(c,,)
8.7
8.7
8.7
He(a, c) := a
8.7
Heronian mean
8.7
8.7
8.8
9.2
9.2
9.2
9.2
9.2
9.2
(Ex. 6)
9.4
(Ex. 7 )
MJk"
>
9.5
kpd
Np := v2(q)1v2(q1'p) ,
~,:={z@IIzI56}
given by 3.2.9
9.5
10.1
10.1
10.1
10.1
Formula
Description
Symbol
= p ~I g
l f -nP / ~ ~ I A
R n ( f ,4
R , ( f , A ) := R J f )
y := l h [ 1 + 2
dn
[ao,a , , . . . ,a,,]
1
[a,, a , , . . . ,a,] := a, + ---1
a, + . a2 +
1
+ -13 + - - + m - log m ] ,
Euler's constant
Section
Formula
10.1
10.1.3
10.2
10.2.20
11.3
11.3
(Ex. 2 )
lim [a,, a , ,
n--
. . ., a , ]
11.3
(Ex. 2 )
4
B,=--
and
B,,+,:=O,
2
Bernoulli numbers
1
(-1)"~,,2"
cos z ,=,
(2n)!
Euler numbers
-=
n=l,2,3,
?T
IZI< z'
Author Index
Numbers in parenthesis indicate references in the Bibliography. "Pr" indicates references in press.
Abel, 5, 28
Abramowitz and Stegun (64), 15, 321
Adams (66).37 1
Aho, Hopcroft and Ullman (74), 201, 203, 206,
21 1
Alladi and Robinson (79), 371
Almqvist (Pr), 99
Almqvist and Berndt (Pr), 3, 196, 371
Andrews (76), 67, 86
Andrews (86). 286
Apery, 365, 37 1, 379
Apostol (74), 37, 40
Apostol (76a). 86
Apostol (76b), 116
Arazy et al. (85), 269, 272
Archimedes, 243, 249, 337, 338
Aristophanes, 347
Askey (go), 89, 308
Backstrom @I), 98, 99
Bailey, D. (88), 211, 341, 342
Bailey, W., 181
Bailey, W. (35), 184
Baker, 323, 348
Baker (75), 349, 351, 354, 356, 363, 371
Baker and Graves-Morris @I), 320, 323
Ballantine (39). 347
Baxter, 80
Beckenbach (50). 236, 269
Beckmann, 346
Beckmann (77). 342
Beeler et a/. (72). 220, 222
Bell (27), 36
Bellman (61), 39, 55, 56, 86, 89
Benson, 301,302, 303
Berndt (Pr), 142, 150, 177, 196, 309
Author Index
Bring, 136
Bromwich (26), 189, 291, 344
Brouncker, 338, 345
Bundschuh (71), 371
Buslaev, Gonchar and Suetin (84), 323
Carlitz (71), 71
Carlson, 248, 257, 261
Carlson (71), 7, 256
Carlson (73, 261
Carlson (78). 262
Catalan, 66
Cauchy, 76, 77, 292
Cayley (1874), 106, 133, 138, 140, 142
Cayley (1895). 22, 102, 138
Chandrasekharan (85). 116
Cheney (66), 321
Chudnovsky and Chudnovsky (84). 368, 371
Clark (7 I), 349
Clausen, 179, 188, 189, 190
Cohen and Nussbaum (87), 273
Comtet, 386
Cook, 69, 213
Cook and Aanderaa (69), 69
Cooley, Lewis and Welch (67), 65
Cooley and Tuckey (65), 65
Cox (85), 15
Dase, 340
Davenport (8 1 ), 226
Davis (79), 365, 37 1
Denninger (84). 89
Dickson (29), 293
Dickson (71), 55, 66, 85, 141, 284, 287, 292
Dirichlet, 86
DuVal (73), 23, 30, 133
Eagle (58), 29
Edwards (79), 252
Epstein, 303
Erdelyi er a/. (53), 10, 30, 55, 178, 179, 181
Euler, 3, 18, 64, 67, 77, 189, 264, 286, 306,
307, 340, 344, 346, 348, 381, 382, 383
Ewell @I), 66
Ewell (82), 306
Ewell (83), 86
Ewell (86), 151
Feldman, 371
Felton, 341
Ferguson, 340
Fermat, 82, 286
Foster and Phillips (84a). 260
Foster and Phillips (84b). 247, 252
Gammel, 323
Gauss, 1, 3, 5, 8, 10, 28, 43, 51, 85, 86, 250,
256, 284, 286
Gauss (1866). 5, 7, 35, 44, 48, 52, 65
Gelfond, 348
Gini (38), 233
Glasser and Zucker (go), 68, 290, 293, 303
Goldfield (85), 295
Gordon (61). 306, 308
Gosper, 341
Gould and Mays (84). 263
Goursat (l881), 185, 186
Gradshteyn and Ryzhik (go), 10
GravC, 345
Graves-Morris, 323
Greenhill (1892), 133, 167
Gregory, 339, 348, 381
Guilloud, 341
Hancock (09), 29
Hanna (28), 133
Hardy, 80, 187
Hardy (40). 141, 314
Hardy, Littlewood and Polya (59). 235, 242,
269
Hardy and Wright, 80
Hardy and Wright (60). 66, 67, 85, 86, 95,
281, 284, 349, 376, 378
Hermite, 135, 264, 348, 359
Hermite and Stieltjes (05). 264
Hilbert, 348. 353, 354
Hiibert (1893). 348
Hirschhorn (85), 285
Holloway, 188
Householder (70), 2 15, 2 16
Hua (82), 294, 349, 375
Hughes (84), 175
Jacobi, 5, 10, 20, 22, 28, 38, 52, 65, 68, 73,
74, 85, 106, 143, 180, 252, 261, 284, 301,
306
Jacobi (18291, 36, 106
Jones, 339
Joubert, 142
Kaltofen and Yui (84), 133
Kanada, 224, 225, 341
Karatsuba, 21 1
Keith (86). 342
King (24), 16, 18, 44, 52, 59, 60
Kiein, 116
KIein (1897), 349
Author Index
Klein (79), 7
Klein and Fricke (1892), 116, 126
Knuth @I), 203, 21 1, 212, 218
Kronecker, 135, 293, 296, 297
Kummer, 179
Kung and Traub (78), 216
Lagrange, 3, 82, 321, 375
Laguerre, 99
Lambert, 99, 348
Landau, 268, 269
Landau (1899), 94
~ a n d a u(58), 86, 294
Landen, 381
Lang (66), 349
Lang (73), 29, 30, 116, 123
Leach and Scholander (78), 237
Legendre, 3, 10, 23, 26, 152, 153, 154, 178,
348
Lehmer, 264, 265
Lehmer (38), 345
Lehmer (71), 247, 263
Lehmer (85), 386
Lehner (66), 1 16
Leibniz, 339, 340
Le Petit Archimede (80), 342
LeVeque (77), 294, 349
Lewin (81), 382, 386
Lindemann, 274, 348
Lindemann (1882), 348
Liouville, 10, 36, 116, 348
Lipson (81), 216
Lorenz, 287, 289
Machin, 339, 341, 344
MacMahon, 67
Mahler (3 I), 359
Mahler (53). 368
Mahler (67), 349, 368
Meil (83), 250
Meinardus, 321
Metropolis, 341
Mignotte (74), 368
Mobius, 100
Mollerup, 89
Mordell, 187
Mordell (l6), 286
Nemeth (77), 321
Newman (79), 32 1
Newman (82), 7, 222, 223
Newman (85), 7
Newman, M., and Shanks (84). 193
Newton, 123, 207, 208, 212, 339, 343, 356
Padt, 320
Pfaff, 250
Phillips (8 I), 252
Picard, 119
PoincarC, 1 16
Poisson, 38
Polya, 86
Rademacher (73), 56, 71, 85, 90, 286
Rainville (60), 179
Ramanathan (84), 8 1
Ramanujan, 69, 80, 81, 84, 100, 138, 140, 144,
146, 150, 154, 157, 158, 161, 164, 179, 184,
186, 187, 188, 194, 195, 197, 287, 295, 306,
307, 308, 309, 310, 311, 312313, 314, 315,
386
Ramanujan (14). 158, 163, 179, 184, 186, 191,
193, 195, 313
Ramanujan (62), 80, 187, 28 1, 286
Rankin (77), 116
Reitwiesner, 341
Ribenboim (84), 345
Ribenboim (85), 95
Riemann, 89
Ritt (48), 226
Rogers, 80
RouchC, 317, 319
Salamin, 51, 220, 222
Salamin (76), 48, 49, 51
Sasaki, 225
Sasaki and Kanada (82), 224, 225
Schepler (50), 342
Schlafli, 147, 297
Schlomilch, 268, 269
Schneider, 348
Schoeneberg (76), 116, 123, 126
Schoeneberg (77), 247, 269
Schoeneberg (82), 247, 250
Schonhage, 21 l
Schonhage and Strassen (7 I), 2 11
Schroter, 11 1, 133
Schur, 80
Schwab, 250
Selberg and Chowla (67). 296
Shanks, D. (82). 193, 194
Shanks, D. and Wrench (62), 341
Shanks, W ., 340
Shanks, W. (1853), 340
Author Index
Singmaster (85), 342
Slater (66), 178, 179
Stickel (85), 222, 224
Stieljes, 264
Stolarsky (75), 233
Stolarsky (go), 237
Stormer, 345
Strassen, 202, 21 1
Sweeney (631, 336
Tamura and Kanada (Pr), 2 11, 342
Tannery and Molk (1893), 138, 141
Todd (49), 345
Todd (75), 394
Todd (79), 7
Toom, 211
Trefethen (84), 321, 325
Tricomi (65), 262
van Ceulen, 342
van der Poorten, 379
Vieta, 338, 343
von Neumann. 341
Wagon (85), 342
Waldo, 342
Subject Index
Subject lndex
e:
irrationality of, 352
transcendence of, 353, 359
Elementary symmetric mean function, 270
Elementary symmetric polynomial, 356
Ellipse, arc length of, 8
Elliptic functions, 29, 30, 3 1
cn, 29
degenerate elliptic functions, 29
dn, 29
fundamental parallelogram, 29, 53
Jacobian, 29
lattice, 29
odd elliptic functions, 31
order, 30
period of, 29
sn, 29, 57
Elliptic integrals
(see Complete and Incomplete)
Equivalence of operations, 2 13
Equivalent iterations, 252
Equivalent mean, 23 1, 239
Estimates of e , 363
Eta-multiplier, 3 11, 3 14
of order p. 3 11
Euler-Mascheroni constant, 336
Euler numbers, 383
computation of, 336
Euler's addition theorem, 18
Euler's formula, 189
Euler's identity, 64
Euler's pentagonal number theorem, 64, 66, 80,
306
Euler's totient function, 301
Evaluation problem, 204
Exponential:
algorithms for, 227
approximations to, 317, 318, 320, 321, 322,
323, 327, 330, 33 1, 335
series for. 320
Exp(n), quadratic algorithm for, 50
Factorial calculation, 218
Fast base conversion, 2 18
Fast computation, 50
Fast Fourier transform (FFT), 204,206.21 1, 212
Fast matrix multiplication, 202
Fast multiplication, 209
Fast polynomial division, 207
Fast polynomial evaluation, 208
Fast polynomial multiplication, 206
Fibonacci numbers, 94, 151, 287, 375
Fibonacci sequences and series, 91, 97, 98, 100
Finite Fourier transform, 204
Fully monotone, 29 1
Fundamental limit theorem, 5
first proof, 5
second proof, 6
third proof, 12
fourth proof, 21
Fundamental regions, 113
Fundamental sets, 1 13
Fundamental unit, 294
Galois group:
for F,, 123
for W, and W,, 136
Gamma function, 24, 27, 28, 87, 332, 336
duplication formula, 88, 90
evaluation of K and E , 25, 27, 189, 191, 297,
298
Subject Index
Gauss's multiplication formula, 90
Gaussian binomial coefficients, 76
Gaussian mean iterations, 246, 249, 251, 252
Gaussian product, 246, 248, 253
Gaussian sums, 83, 86
Gelfond-Schneider theorem, 277, 278, 348
Generalized complete elliptic integrals:
of first kind, 178
of second kind, 178
transformations, 179, 180, 185
Generalized Legendre symbol, 293
Generalized singular value function, 185
Genera, 295
Geometric series, 91
Gini's means, 233, 264
Gregory's series, 339
Halley's method, 216
Height, 371
Heronian mean, 237, 269, 270
Hexagonal lattice, 292
Hexagonal sum, 292
Holder's means, 232, 235, 263, 264
Homogeneous mean, 23 1
Homogeneous multidimensional mean, 266
Homer's rule, 204
Huwitz zeta function, 303, 304
Hyberbolic arclemniscate, 259
Hyberbolic function identities, 74, 75, 100
Hypergeometric functions, 332-334
Hypergeometric functions and series:
Gaussian hypergeometric series, 8
generalized, 178
hypergeometric differential equation, I I
Identric mean, 234
Incomplete elliptic integrals, 10
Eulers addition theorem, 18
of first kind, 10, 58, 60
of second kind, 10
of third kind, 10
Inhomogeneous modular group, T, 1 13, 1 17
Interpolation problem, 204
Invariance principle, 245, 246, 260
Invariants, evaluating, 293
Irrationality measures, 362
for exp(l/v), 363, 364, 371, 377
for exp(n), 371
for log, 370, 379
for n , 367, 368, 371
for n Z , 367
for 5(3), 369, 370
Irrationality of e. 352
Irrationality of n , 352
Isotone mean, 23 1
Isotone multimimensional mean, 266
Jacobian elliptic function, 29
cn, 29
dn, 29
half angle formula for cn, 262
half angle formula for sn, 30
sn, 29, 57
Jacobi's differential equation, 20, 22
Jacobi's duplication formula for arcsl, 260
Jacobi's formula for r4(n), 81
Jacobi's identity, 35
Jacobi's imaginary quadratic transformation, 73,
180
Jacobi's triple-product identity, 62, 65. 66, 72,
301, 306
corollaries, 64, 65
finite form, 77
first proof, 62
Khinchin's theorem, 362, 371
Klein's absolute invariant, 115, 179
Kronecker symbol, 293
Kummer's indentity, 179
Lagrange interpolation formula, 321
Lambert series, 91, 99, 281, 286, 288, 300
Landen transform, 17, 57, 59
in terms of elliptic functions, 58
in terms of theta functions, 57
Laplace transform, 39
Lattice sums, chemical, 288, 290
Legendre's relation, 24, 26, 27, 49, 152, 153
generalized, 178
theta function proof, 43
Lehmer's means, 232, 235, 263
Lemniscate sine, 5, 28
Length, 358
Lengths and measures, 358
Lindemann's theorem, 348, 357
Liouville numbers, 35 1 , 352
Liouville's function, 100
Liouville:~principle, 54
Liouville's summation principle, 36
Logarithm:
approximations to, 325, 327, 328, 330
bit complexity, 219, 222
calculation, 220, 221, 222, 223
complex, 222
operational complexity, 219, 222
series for, 320
theta function algorithms for, 224
Subject Index
Logarithmic mean, 234, 248, 261
Lucas numbers, 95, 97, 100, 287
Machin's formula, 339, 340, 341, 344
Madelung's constant, 288, 292, 301
Matrix logarithm, 224, 225, 226
Mean, 230, 231
discontinuous, 230, 238
equivalent mean, 23 1
homogeneous, 23 1, 266
isotone, 23 1, 266
strict, 230, 266
symmetric, 231, 266
trace of, 23 1
Mean iterations, 243
algebraic, 273-280
Archimedean, 246
arithmetic-harmonic, 4
Carison's, 257
Gaussian, 246
harmonic-geometric, 4
Lehmer's 265
multidimensional, 267
pathology, 253
rates of convergence, 251, 267, 272
Schlomilch's, 268
Tricomi's, 265
Means:
classes:
Gini, 233, 264
Heronian, 237, 269, 270
Holder, 232, 235, 263, 264
identric, 234
Lehmer, 232, 235, 263
logarithmic, 234, 248, 261
Neo-Pythagorean, 255
series expansions of, 263-266
Stolarsky, 233, 236, 264
Measure, of polynomial, 359
Meinardus conjecture, 32 1
Mellin transform, 87, 90, 289, 290, 301
Modular equations, 103, 3 15
cubic, 104, 107, 109, 110
degree of, 125
of degree 15, 3 I4
of degree 23, 133
elliptic, 112
endecadic, 106
for j, 123
for K , , and K,,4, 315
for A, of order p, 121, 140
octicity of u-v form, 126, 134
quadratic, 109
quintic, 105, 107, 109, 135, 136, 297, 313
Subject Index
general iteration, 169
irrationality, 348, 352
irrationality measure, 367
legislation on, 342
mnemonic for, 342
normality of, 342
quadratic algorithm for e x p ( ~ )50
,
quadratic iteration, 170, 174
quartic iteration, 170, 174
quintic iteration, 175, 3 13
septic iteration, 171, 174, 175
series for I/T, 181-190
transcendence of, 347, 348, 352, 354
Picard's theorem, 1 19
Piecewise monotone, 234
Pochhammer symbol, 178
Poisson summation formula, 36, 89
Prime number theorem, 378
Primitive binary form, 141
Properly equivalent forms, 141
Q-binomial coefficients, 76
Q-binomial theorem. 308
Quadratically attractive transformation, 277
Quadratic computation, 50
Quadratic modular equation, 102
Quadratic reciprocity, 86
Quintic multipliers, 309
Quintuple-product identity, 143, 146, 306
Ramanujan's continued fraction identity, 8 1
Ramanuian's invariants. 70. 179
Ramanujan's modular identity:
of order 3, 287
of order 5, 310, 312
of order 7, 312
sum, 308
Ramanujan's ,P,
Ramanujan's multiplier of second kind, 157, 159
Rational mean iteration, 278
Recursion, 201
Recursion formula for r2(n), r3(n), 15 1, 306, 307
Reduced complexity methods, 327
acceleration based on:
binary splitting, 329
on FFT, 328
on functional equations, 327
Reversion of power series, 208
Riemann hypothesis, 90
Rising factorial, 178
Rogers-Ramanujan identities, 65, 78, 80
Bressoud's "easy proof," 78
Roth's theorem, 363
Schlafli's equation, 147
413
Schonhage-Strassen multiplication, 21 1
Schroter's formula, I I I
Schwarz derivative, 20, 118
Septic multipliers, 309
Simple continued fractions, 372
Singular invariants, 141
Singular moduli, 139
Singular value function, 152
of first kind, 152
generalized, I85
of second kind, 152ff
Singular values, 26.49, 139, 140, 141, 172, 173,
296, 297
Solvability of quintic in modular terms, 135
Solvable modular equations, 310, 31 1, 313
Solvable numbers:
of type one, 293
of type two. 293
Square-free invariants, 295
Stirling numbers, 305
Stirling's formula, 90
Stolarsky's means, 233, 236, 264
identric, 234, 271
logarithmic, 234, 271
multidimensional, 271
Strict mean, 230
Strict multidimensional mean, 266
Strong equivalence, 241, 243
of iterations, 252
Sums of squares:
of two squares, 82, 285, 290
of three squares, 66, 151, 286
of four squares, 81
of others, 71, 287, 292, 293
Supremum norm, 316
Symmetric mean, 231
Symmetric multidimensional mean, 266
Symmetric polynomial, 356
Tan, computation of, 227
Theta functions, 10, 33, 91, 93
basic identities, 64,67, 68,70,71, 73,74, 1 I I
finite transformation, 86, 87
general theta functions, 52
one-dimensional heat equation, 56
theta transformation formulae, 38, 44, 54, 87
Theta series, 91
Trace of mean, 231
Transcendence of e, 348, 353, 359
Transcendence of T, 347, 348, 352, 354
Transcendental functions, 274
Transcendental number, 35 1, 352
Transformation of order p. 119
Transformations of complete elliptic integrals:
414
Subject index