Eigenvalues and Eigenvectors: 5.1 What Is An Eigenvector?
Eigenvalues and Eigenvectors: 5.1 What Is An Eigenvector?
Eigenvalues and Eigenvectors: 5.1 What Is An Eigenvector?
Eigenvalues and
Eigenvectors
173
174 CHAPTER 5. EIGENVALUES AND EIGENVECTORS
vector and characteristic value are also used; sometimes the word “proper” is
substituted for characteristic. We formalize this discussion with the following:
L[g1 ]
g1
g2
L[g2 ]
Figure 5.1
Definition 5.1. Let L be a linear transformation that maps a vector space into
itself. A nonzero vector x is called an eigenvector of L if there is a scalar λ such
that L(x x ) = λx
x. The scalar λ is called an eigenvalue of L and the eigenvector
is said to belong to, or correspond to, λ.
OK, we know what we want, eigenvectors. How do we find them? Let’s
examine a 2 × 2 matrix. Let
2 6
A=
1 3
and suppose that A is the matrix representation of a linear transformation L
with respect to the standard basis. Thus, for any x = (x1 , x2 ) we have
2 6 x1 2x1 + 6x2
x
L(x ) = =
1 3 x2 x1 + 3x2
We want to find those numbers λ for which there is a nonzero vector x such
x) = λx
that L(x x . Thus,
x x
A 1 =λ 1
x2 x2
or
x1 0
(A − λI2 ) =
x2 0
5.1. WHAT IS AN EIGENVECTOR? 175
Hence, we are looking for those numbers λ for which the equation (A−λI2 )x x =0
has a nontrivial solution. But this happens if and only if det(A − λI2 ) = 0. For
this particular A we have
2−λ 6
det(A − λI2 ) = det = (2 − λ)(3 − λ) − 6
1 3−λ
= λ2 − 5λ = λ(λ − 5)
The only values of λ that satisfy the equation det(A − λI2 ) = 0 are λ = 0 and
λ = 5. Thus the eigenvalues of L are 0 and 5. An eigenvector of 5, for example,
will be any nonzero vector x in the kernel of A − 5I2 .
In the following pages when we talk about finding the eigenvalues and eigen-
vectors of some n×n matrix A, what we mean is that A is the matrix representa-
tion, with respect to the standard basis in Rn , of a linear transformation L, and
the eigenvalues and eigenvectors of A are just the eigenvalues and eigenvectors
of L.
Example 1. Find the eigenvalues and eigenvectors of the matrix
2 6
1 3
From the above discussion we know that the only possible eigenvalues of A are
0 and 5.
λ = 0: We want x = (x1 , x2 ) such that
2 6 1 0 x1 0
−0 =
1 3 0 1 x2 0
2 6
The coefficient matrix of this system is , and it is row equivalent to the
1 3
1 3
matrix . The solutions to this homogeneous equation satisfy x1 = −3x2 .
0 0
Therefore, ker(A − 0I2 ) = S[(−3, 1)], and any eigenvector of A corresponding
to the eigenvalue 0 is a nonzero multiple of (−3, 1). As a check we compute
−3 2 6 −3 0 −3
A = = =0
1 1 3 1 0 1
0 0 0 4
4). The eigenvalue 2 has multiplicity 3 while the eigenvalue 4 has multiplicity
1.
One last bit of terminology: When we wish to talk about all the eigenvectors
associated with one eigenvalue we use the term eigenspace. Thus, in Example 2
the eigenspace of the eigenvalue (−1) is just ker(A − (−1)I). In general, if L is
any linear transformation from a vector space into itself and λ0 is an eigenvalue
of L, the eigenspace of λ0 is ker(L − λ0 I). That is, the eigenspace of λ0 consists
of all its eigenvectors plus the zero vector. Note that the zero vector is never an
eigenvector.
We’ve seen how to compute the eigenvalues of a linear transformation if the
linear transformation is matrix multiplication. What do we do in the more
178 CHAPTER 5. EIGENVALUES AND EIGENVECTORS
By p(A) we mean
21. Show that det(A−λI) = det(AT −λI) for any constant λ. Thus, A and AT
have the same characteristic polynomial, and hence the same eigenvalues.
22. Find a 2×2 matrix A for which A and AT do not have the same eigenspaces.
23. Find the eigenvalues, their multiplicities, and the dimensions of the cor-
responding eigenspaces for each of the following matrices:
1 1 0 0 1 1 0 0 1 1 0 0
0 1 0 0 0 1 1 0 0 1 1 0
a.
0 0 1 0
b.
0 0 1 0
c.
0 0 1 1
0 0 0 1 0 0 0 1 0 0 0 1
−1 3
24. Let A = . Define L : M22 → M22 by L[B] = AB for any B in
1 1
M22 . Find the eigenvalues of L, their multiplicities, and the dimensions
of the eigenspaces.
25. Let A be any n × n matrix. Define L : Mnn → Mnn by L[B] = AB. Show
that λ is an eigenvalue of L if and only if λ is an eigenvalue of the matrix
A. Remember that λ is an eigenvalue of A only if there exists a nonzero
x in Rn such that Ax x = λxx.
´t
26. Define L : P2 → P2 by L[pp](t) = (1/t) 0 p (s)ds. Find the eigenvalues,
their multiplicities, and the dimensions of the eigenspaces.
1 3 1 0
27. Let A = . Let B = . Define L : M22 → M22 by
0 1 0 6
L[xx] = Axx + x B. Show that L is a linear transformation, and then find
its eigenvalues, their multiplicities, and the corresponding eigenspaces.
182 CHAPTER 5. EIGENVALUES AND EIGENVECTORS
0 = L[c1f 1 + · · · + cpf p ]
= c1 L(ff 1 ) + · · · + cp L(ff p )
= c1 λ1f 1 + · · · + cp λpf p (5.4)
Thus, the roots of p(λ) are −1, 1, 2, and 3. Since they are real and distinct,
Theorem 5.3 guarantees that R4 will have a basis that consists of eigenvectors
of L. We next find one eigenvector for each of the eigenvalues.
λ = −1:
−2 0 2 −4
−6 3 2 −5
A − (−1)I = 4 0
0 4
6 0 −2 8
is row equivalent to
1 0 0 1
0 1 0 1
0 0 1 −1
0 0 0 0
Thus, ker(A + I) = S[(−1, −1, 1, 1)], and f 1 = (−1, −1, 1, 1) is an eigenvector
corresponding to −1.
5.2. DIAGONALIZATION OF MATRICES 185
λ = 1:
−4 0 2 −4
−6 1 2 −5
A−I = 4 0 −2
4
6 0 −2 6
1 0 0 1
0 1 0 1
is row equivalent to the matrix
0 0 1 0. Thus, f 2 = (−1, −1, 0, 1) is an
0 0 0 0
eigenvector corresponding to the eigenvalue 1.
λ = 2: The matrix A − 2I is row equivalent to the matrix
1 0 0 1
0 0 1 −1
0 0 1 1
0 0 0 0
It is clear from some of our previous examples that rather than having dis-
tinct eigenvalues it is possible that some eigenvalues will appear with multi-
plicity greater than 1. In this case, Theorem 5.3 is not applicable, and we use
Theorem 5.4, whose proof is omitted.
Theorem 5.4. Let V be an n-dimensional real vector space. Let L : V →
V be a linear transformation. Let p(λ), the characteristic polynomial of L,
equal (λ − λ1 )m1 (λ − λ2 )m2 . . . (λ − λp )mp . Assume each of the roots λj , 1 ≤
j ≤ p is real. Then L has a diagonal matrix representation if and only if
186 CHAPTER 5. EIGENVALUES AND EIGENVECTORS
dim(ker(A − λj I)) = mj for each of the eigenvalues λj ; that is, the number of
linearly independent solutions to the homogeneous equation (A−λj I)x x = 0 must
equal the multiplicity of the eigenvalue λj .
This last example is a special case of a very important and useful result.
Before stating it, we remind the reader that a matrix A is said to be symmetric
if A = AT ; cf. Example 2b.
5.2. DIAGONALIZATION OF MATRICES 187
Solution. We note that A is symmetric and hence Theorem 5.5 guarantees that
there is a basis of R3 that consists of eigenvectors of A. The characteristic
polynomial of A, p(λ), equals (2 + λ)2 (7 − λ). Thus, L has two real eigenvalues
−2 and 7; −2 has multiplicity 2 and 7 has multiplicity 1. Computing the
eigenvectors we have:
λ = −2 : A + 2I is row equivalent to the matrix
1 1 −1
0 0 0
0 0 0
Thus, (1, 1, −1) is a basis for ker(A− 7I) and the set {(1, 1, −1), (1,0,1) (0, 1, 1)}
is a basis for R3 . Clearly the matrix representation of L with respect to this
basis is
7 0 0
0 −2 0
0 0 −2
There is one more idea to discuss before we conclude this section on diago-
nalization. Suppose we start out with a matrix A (the representation of L with
respect to the standard basis of Rn ), and after calculating the eigenvalues and
eigenvectors we see that this particular matrix has n linearly independent eigen-
vectors; i.e., Rn has a basis of eigenvectors of A. Suppose the eigenvalues and
eigenvectors are {λj : j = 1, . . . , n} and F = {ff j : j = 1, . . . , n} respectively.
Then the matrix representation of L with respect to the basis F is [λj , δjk ] = D.
How are A and D related? The answer to this has already been given in Sec-
tion 3.5; for if P is the change of basis matrix that satisfies [x x]TS = P [xx]TF , then
188 CHAPTER 5. EIGENVALUES AND EIGENVECTORS
P is easy to write out, for its columns are the coordinates of the eigenvectors
with respect to the standard basis. Moreover, we have
A = P DP −1 (5.5)
or
D = P −1 AP
Note that in the formula D = P −1 AP , the matrix that multiplies A on the right
is the matrix whose columns consist of the eigenvectors of A. Note also that A
and D are similar; cf. Section 3.5.
Example 4. Let A be the matrix
1 3 −3
3 1 −3
−3 −3 1
Since A is symmetric, we know that it can be diagonalized. In fact, in Example 3,
we computed the eigenvalues and eigenvectors of A and got
λ1 = −2 f 1 = (1, 0, 1
λ2 = −2 f 2 = (0, 1, 1)
λ3 = 7 f 3 = (1, 1, −1)
Thus,
−2 0 0 1 0 1
D = 0 −2 0 and P = 0 1 1
0 0 7 1 1 −1
Computing P −1 we have
D = P −1 AP
2 −1 1 1 3 −3 1 0 1
1
= −1 2 1 3 1 −3 0 1 1
3
1 1 −1 −3 −3 1 1 1 −1
−2 0 0
= 0 −2 0
0 0 7
Formula (5.5) and the following calculations enable us to compute fairly
easily the various powers of A, once we know the eigenvalues and eigenvectors
of this matrix.
A2 = [P DP −1 ][P DP −1 ] = P D(P −1 P )DP −1 = P D2 P −1
In a similar fashion, we have
An = P Dn P −1 (5.6)
The advantage in using (5.6) to compute An is that D is a diagonal matrix and
its powers are easy to calculate.
5.2. DIAGONALIZATION OF MATRICES 189
1 3
Example 5. Let A = ; cf. Example 4 in Section 1.4. Compute An for
0 4
arbitrary n.
Solution. Computing the characteristic polynomial of A we get
1−λ 3
p(λ) = det = (λ − 1)(λ − 4)
0 4−λ
2. For each of the following matrices find a diagonal matrix that is similar
to the given matrix:
−1 0 8 2 4 2 1 2
a. b. c. d.
2 3 1 7 2 1 0 0
g(λ1 ) 0
g(A) =
0 g(λ2 )
11. If c is a nonnegative real number, we can compute its square root. In fact
cα is defined for any real number α if c > 0. Suppose A is a diagonalizable
matrix with nonnegative eigenvalues. Thus A = P DP −1 , where D equals
[λj δjk ]. Define Aα = P Dα P −1 , where Dα = [λα
j δjk ].
13. For each matrix in the preceding problem compute when possible A−1/6
and A2/3 .
14. Compute A1/2 and A−1/2 where A is the matrix in problem 5. Verify that
A1/2 A−1/2 = I.
5.3 Applications
In this section, instead of presenting any new material we discuss and analyze
a few problems by employing the techniques of the preceding sections. Several
things should be observed. One is the recasting of the problem into the lan-
guage of matrices, and the other is the method of analyzing matrices via their
eigenvalues and eigenvectors.
The reader should also be aware that these problems are contrived, in the
sense that they do not model (to the author’s knowledge) any real phenomenon.
They were made up so that the arithmetic would not be too tedious and yet the
flavor of certain types of analyses would be allowed to seep through. Modeling
real world problems mathematically usually leads to a lengthy analysis of the
physical situation. This is done so that the mathematical model is believable;
that is, we wish to analyze a mathematical structure, for us a matrix, and then
be able, from this analysis, to infer something about the original problem. In
most cases this just takes too much time—not the analysis, but making the
model believable.
Example 1. An individual has some money that is to be invested in three
different accounts. The first, second, and third investments realize a profit of 8,
10, and 12 percent per year, respectively. Suppose our investor decides that at
the end of each year, one-fourth of the money earned in the second investment
and three-fourths of that earned in the third investment should be transferred
into the first account, and that one-fourth of the third account’s earnings will
be transferred to the second account. Assuming that each account starts out
with the same amount, how long will it take for the money in the first account
to double?
192 CHAPTER 5. EIGENVALUES AND EIGENVECTORS
Solution. The first step in analyzing this problem is to write down any equations
relating the unknowns. Thus let ajk , j = 1, 2, 3; k = 1, 2, . . . represent the
amount invested in account j during the kth year. Thus, if a dollars were
originally invested in each of the three accounts we have a11 = a21 = a31 = a.
Moreover, we also have
Our problem is to determine k such that a1k = 2a. Clearly the eigenvalues of A
are 1.08, 1.075, and 1. Routine calculations give us the following eigenvectors:
λ1 = 1.08 f 1 = (1, 0, 0)
λ2 = 1.075 f 2 = (−5, 1, 0)
λ3 = 1 f 3 = (−5, −2, 5)
1 5 3
1 −5 −5
2
Setting P = 0 1 −2 , we have P −1 = 0 1 5 . Thus, A =
0 0 5 1
0 0 5
1.08 0 0
P 0 1.075 0 P −1 and
0 0 1
a1k , which is the first component of the vector Ak−1 U1 , must equal
then certainly (5.11) will hold with the equality sign replaced by ≥. Now (5.12)
implies (1.075)k−1 = 23 . Thus k − 1 = (ln 3 − ln 2)/ ln 1.075 ≈ 5.607 or k ≈ 6.607
years. In other words the amount of money in the first account will certainly
have doubled after 7 years.
This example also illustrates another aspect of mathematical analysis, ap-
proximations. Equation (5.11) is extremely difficult to solve, while (5.12) is
much easier and more importantly supplies us with a usable solution.
Example 2. Suppose we have two different species and their populations dur-
ing the kth year are represented by xk and yk . Suppose that left alone the
populations grow according to the following equations:
xk+1 = 3xk − yk
yk+1 = −xk + 2yk
Our problem is the following. What percentage of each species can be removed
(harvested) each year so that the populations remain constant?
Solution. Let r1 and r2 denote the fractions of xk and yk that are removed. We
have 0 < r1 , r2 < 1, and at the end of each year we effectively have (1 − r1 )xk
and (1 − r2 )yk members left in each species. These comments imply
xk+1 3(1 − r1 ) −(1 − r2 ) xk
=
yk+1 −(1 − r1 ) 2(1 − r2 ) yk
Clearly this is impossible. Hence, we cannot remove any percentage and leave
the populations fixed. Well, maybe we asked for too much. Perhaps instead of
wanting our removal scheme to work for any population, we should instead try
to find a population for which there is some removal scheme. Thus, if A is the
matrix
3(1 − r1 ) −(1 − r2 )
−(1 − r1 ) 2(1 − r2 )
194 CHAPTER 5. EIGENVALUES AND EIGENVECTORS
T T
we want to find numbers x and y such that A x y = x y . In other
words, can we pick r1 and r2 so that 1 not only is an eigenvalue for A but also
has an eigenvector with positive components. Let’s compute the characteristic
polynomial of A.
3(1 − r1 ) − λ −(1 − r2 )
p(λ; r1 , r2 ) = det
−(1 − r1 ) 2(1 − r2 ) − λ
= [λ − 3(1 − r1 )][λ − 2(1 − r2 )] − (1 − r1 )(1 − r2 )
= λ2 − (5 − 3r1 − 2r2 )λ + 5(1 − r1 )(1 − r2 )
Example 3. This last example is interesting in that it is not all clear how to
express the problem in terms of matrices. Suppose we construct the following
sequence of numbers: let a0 = 0 and a1 = 1, a2 = 12 , a3 = (1 + 12 )/2 = 43 ,
a4 = ( 21 + 43 )/2 = 58 , and in general an+2 = (an + an+1 )/2; that is, an+2 is
the average of the two preceding numbers. Do these numbers approach some
constant as n gets larger and larger?
5.3. APPLICATIONS 195
T
Solution. Let Un = an−1 an for n = 1, 2, . . . . Recalling how the an are
defined, we have
an
an
Un+1 = = an+1 an
an+1 +
2 2
0 1
an−1
= 1 1 = AUn (5.15)
an
2 2
where A is the 2 × 2 matrix appearing in (5.15). As in similar examples, we
T
have Un = An−1 U1 , where U1 equals 0 1 . The characteristic polynomial of
A is
−λ 1
λ1 = 1, f 1 = (1, 1)
1
λ = − , f 2 = (−2, 1)
2
1 −2 1 2
Setting P equal to , we calculate P −1 = 31 . Thus, we
1 1 −1 1
have
1 2
1 0
1 −2 n 3 3
An =
1
1 1 0 − 1 1
2 −
3 3
Clearly, the larger n gets
the closer the middle matrix on the right-hand side
1 0
gets to the matrix which means that the vector Un = An−1 U1 gets close
0 0
to the vector U∞ , where
1 2
1 −2 1 0 3 3 0
U∞ =
1 1 0 0 1 1 1
−
3 3
1 2 2
3 3 0 3
= =
1 2 1 2
3 3 3
T 2 2 T
Thus, Un = an−1 an gets close to 3 3 , which means that the numbers
an get close to the number 32 . Notice too that U∞ is an eigenvector of A
196 CHAPTER 5. EIGENVALUES AND EIGENVECTORS
a0 = 1, a1 = 1, a2 = 2, a3 = 3, . . . , an+1 = an + an−1
Find a general formula for an and determine its limit if one exists.
3. Define the following sequence of numbers:
1 2 1 2
a0 = a a1 = b a2 = a0 + a1 and an+2 = an + an+1
3 3 3 3
Find a general formula for an and determine its limiting behavior.
4. Define the following sequence of numbers:
6. We again have the same two cities as in problem 5, only this time let’s
assume that r1 represents the fraction of the first city’s population that
moves to the second city and r2 the fraction of the second city’s population
that moves to the first city.
a. Find formulas for xk+1 and yk+1 .
b. What is the limiting population of each city?
7. Suppose that the populations of two species change from one year to the
next according to the equations
Are there any initial populations and harvesting schemes that leave the
populations constant from one year to the next?
8. Let u0 = a > 0, u1 = b > 0. Define u2 = u0 u1 , un+2 = un un+1 . What is
lim un ? Hint: log un =?
Supplementary Problems
1. Define and give examples of the following:
a. Eigenvector and eigenvalue
b. Eigenspace
c. Characteristic matrix and polynomial
2. Find a linear transformation from R3 to R3 such that 4 is an eigenvalue
of multiplicity 2 and 5 is an eigenvalue of multiplicity 1.
3. Let L be a linear transformation from V to V . Let λ be any eigenvalue of
L, and let Kλ be the eigenspace of λ. That is, Kλ = {xx : Lx
x = λx
x }. Show
that Kλ is a subspace of V , and that it is invariant under L, cf. number
13 in the Supplementary Problems for Chapter 3.
4. Let λ be an eigenvector of L. A nonzero vector x is said to be a generalized
eigenvector of L, if there is a positive integer k such that
[L − λI]k x = 0
Show that the set of all generalized eigenvectors of λ along with the zero
vector is an invariant subspace.
5. Let
2 1 0
A = 0 2 1
0 0 2
198 CHAPTER 5. EIGENVALUES AND EIGENVECTORS
6. Let
1 2 −4
A= 0 −1 6
0 −1 4
x, x } is linearly independent.
a. Show that F = {Lx
b. Show that the matrix
representation
A of L with respect to the basis
0 1
F of part a equals .
0 0
c. Deduce that L2y = 0 for every vector y in R2 .
9. Let
1 0 0 2
0 1 3 0
A=
0
0 2 0
0 0 0 2
Find a matrix P such that P −1 AP is a diagonal matrix. Compute A10 .
P2
11. Let V = { n=0 an cos nt : an any real number}.
a. Show that {1, cos t, cos 2t} is a basis of V .
P2 P2
b. Define L : V → V by L( n=0 an cos nt) = n=0 −n2 an cos nt. Find
the eigenvalues and eigenvectors of L. Note that L[f ] = −f ′′ .
c. Find L1/2 , i.e., find a linear transformation T such that T 2 = L.
d. For any positive integers p and q find Lp/q .
12. Consider the following set of directions. Start at the origin, facing toward
the positive x1 axis, turn 45 degrees counterclockwise,
√ and walk 1 unit in
√
this direction. Thus, you will be at the point (1/ 2, 1/ 2). Then turn 45
degrees counterclockwise and walk 12 unit in this new direction; then turn
45 degrees counterclockwise and walk 14 unit in this direction. If a cycle
consists of a 45-degree counterclockwise rotation plus a walk that is half
as long as the previous walk, where will you be after 20 cycles?
13. Let M be the set of 2 × 2 matrices A for which
1 1
A =λ
0 0