Chapter 7: Eigenvalues and Eigenvectors 1
Chapter 7: Eigenvalues and Eigenvectors
SECTION A Introduction to Eigenvalues and Eigenvectors
By the end of this section you will be able to
· understand what is meant by an eigenvalue and eigenvector
· determine eigevalues and eigenvectors
· prove properties of eigenvalues and eigenvectors
This is an important chapter in linear algebra because it has many applications in the areas of physical sciences and engineering. This section is straightforward but it does rely on a number of topics in linear algebra such as matrices, determinants, vectors etc.
You need to thoroughly know how to evaluate determinants to understand this chapter.
A1 Definition of Eigenvalue and Eigenvector
Before we define what is meant by an eigenvalue and eigenvector we do an example which involves these.
Example 1
Let and then evaluate .
Solution.
Multiplying the matrix and column vector we have
Where it follows dot multiplication like
(4, -2) . (2,1) = 4 x 2 + -2 x 1 = 6
(1, 1) . (2, 1) = 1 x 2 + 1 x 1 = 3
What do you notice about your result?
Well we have . In geometric terms we have that the matrix A scalar multiplies the vector u by 3:
Fig 1
In general terms this can be written as
(7.1)
where A is a square matrix, u is a column vector and (lambda) is a scalar. Can you think of a vector, u, which satisfies equation (7.1)?
The zero vector . In this case we say we have the trivial solution. In this chapter we consider the non-trivial solutions, , and these solutions are powerful tools in linear algebra.
For non-zero vector u the scalar is called an eigenvalue of the matrix A and the vector u is called an eigenvector belonging or corresponding to which satisfies
. In most linear algebra literature the Greek letter lambda, , is used for eigenvalues.
These terms eigenvalue and eigenvector are derived from the German word ‘eigenwerte’ which means ‘proper value.’ The word eigen is pronounced “i-gun”.
Eigenvalues was initially developed in the field of differential equations by Jean D’ Alembert.
Fig 2 Alembert 1717 to 1783
However Alembert’s father died when he was only nine years old but his father’s family looked after his financial situation so that he could be educated.
In 1735 Alembert graduated and thought a career in law would suit him but his real thirst and enthusiasm was for mathematics and did this in his spare time. Three years later he did qualify for an advocate but did not pursue a career in this field but chose mathematics instead.
For most of his life he worked for the Paris Academy of Science and the French Academy.
He was well known to have a short fuse and loved to argue with most of his contemporaries.
Example 2
Let . Verify the following:
(a) is an eigenvector of the matrix A corresponding to the eigenvalue .
(b) is an eigenvector of the matrix A corresponding to the eigenvalue .
Solution.
(a) Multiplying the given matrix A and column vector u we have
Where it follows dot multiplication like
(1, 1) . (1,1) = 1 x 1 + 1 x 1 = 2
(-2, 4) . (1, 1) = -2 x 1 + 4 x 1 = 2
Thus is an eigenvector of the matrix A belonging to because .
(b) Similarly we have
Where it follows dot multiplication like
(1, 1) . (1,2) = 1 x 1 + 1 x 2 = 3
(-2, 4) . (1, 2) = -2 x 1 + 4 x 2 = 6
Thus is an eigenvector of the matrix A belonging to because .
What do you notice about your results?
A 2 by 2 matrix can have more than one eigenvalue and eigenvector.
Note that we can have eigenvalues and eigenvectors for any square matrices such as , , , etc which satisfy .
Example 3
Let and . Determine an eigenvalue of this matrix A.
Solution.
Multiplying the matrix and column vector we have
Where it follows dot multiplication like
(5, 0, 0).(0,1,2) = 5 x 0 + 0 x 1 + 0 x 2 = 0
(-9, 4, -1).(0,1,2) = -9 x 0 + 4 x 1 + -1 x2 = 2
(-6, 2, 1).(0,1,2) = -6 x 0 + 2 x 1 + 1 x 2 = 4
We have where . Hence is an eigenvalue of the matrix A with an eigenvector u.
A2 Characteristic Equation
From the above equation (7.1) we have
[ Multiplying by the identity keeps it the same]
where is the n by n identity matrix. We can rewrite this as
Under what condition is the non-zero vector u a solution of this equation?
By Proposition (2.30)?? of chapter 2 we have a non-zero vector u if and only if
This is an important equation and is called the characteristic equation. We give this a reference number
(7.2)
We drop the subscript n because we assume the identity matrix I is of appropriate size.
In some linear algebra texts you will see (7.2) written as . Both of these are equivalent so it does not matter which we apply to find the eigenvalues.
This second approach does have the advantage of not expanding brackets like
But if you remove 2 minus signs then it should easier to expand this:
Advantage of using (7.2) is that you don’t need to subtract the matrix A, it just boils down to subtracting along the leading diagonal.
We use this characteristic equation to find the eigenvalues and corresponding eigenvectors. The procedure for this is as follows:
1. Solve the characteristic equation for the scalar .
2. For the eigenvalue determine the corresponding eigenvector u by solving the system .
Let’s follow this procedure for the next example.
Note that eigenvalues and eigenvectors come in pairs. You cannot have one without the other.
Example 4
Determine the eigenvalues and corresponding eigenvectors of .
Also sketch the effect of multiplying the eigenvectors by matrix A on .
Solution.
What do we find first, the eigenvalues or eigenvectors?
Eigenvalues because eigenvectors are born out of eigenvalues. Remember we carry out the procedure outlined above:
Step 1. Solve the characteristic equation for the scalar .
We need to find λ in . First we obtain :
Substituting this into gives
For eigenvalues we equate this determinant to zero:
Step 2. For the eigenvalue determine the corresponding eigenvector u by solving the system .
Let u be the eigenvector corresponding to . Substituting and into gives
Remember and let and so we have
Multiplying out gives
Where it follows dot multiplication like
(0, 0) . (x, y) = 0 x X + 0 x Y = 0
(1, 1) . (x, y) = 1 x X + 1 x Y = x + y
Remember the eigenvector cannot be the zero vector, therefore at least one of the
values, or y, must be non-zero. From the bottom equation we have .
Simplest solution is but we can have etc
We have an infinite number of solutions. We need to write down the general eigenvector u. How?
Let then where and is a scalar. Thus the eigenvectors belonging to are
where
Similarly we find the general eigenvector v corresponding to the other eigenvalue . Putting into gives
By writing [different x and y from those above] and we obtain
Multiplying out:
From these equations we must have . What is y equal to?
We can choose y to be any real number apart from zero. Thus
where
The general eigenvector belonging to is
where
Summarizing the above we have for the eigenvalue the eigenvector
and for the eigenvector . What does all this mean?
It means that we must have and which in this case is
and
Plotting these eigenvectors and the effect of multiplying by the matrix A is shown:
The effect on the eigenvectors of multiplying by the matrix A is to produce scalar multiples of itself as you can see in Fig 3.
Eigenvectors are non-zero vectors which are transformed to scalar multiples of themselves by a square matrix A.
Next we find the eigenvalues and eigenvectors of a 3 by 3 matrix. Follow the algebra carefully because you will have to expand brackets like
To expand this it is easier to take out 2 minus signs and then expand, that is
Example 5
Determine the eigenvalues of
Solution.
We have
It is easier to remember that is actually matrix A with along the leading diagonal (from top left to bottom right). We need to evaluate .
What is the easiest way to find this?
From the properties of determinants of chapter 2 we know that it will be easier to evaluate the determinant along the second row, containing the elements 0, and 0. Why?
Because it has two zeros so that we do not have to evaluate the 2 by 2 determinants associated with these zeros. From above we have
By the characteristic equation (7.2) which is , means that we equate all the above to zero:
Solving this gives the eigenvalues .
Example 6
Determine the eigenvectors associated with for the matrix A given in the above Example 5.
Solution.
Substituting the eigenvalue and the matrix into (subtract 3 from the leading diagonal) gives:
where u is the eigenvector corresponding to .
What is the zero vector, O, equal to?
Remember this zero vector is and also let . Substituting these into the above and simplifying gives
Expanding this yields the linear system
From the middle equation we have . From the top equation we have
which gives
If then ; or more generally if then where [Not Zero].
The general eigenvector where and corresponds to .
The eigenvectors corresponding to and are part of Exercise 7a.
A3 Eigenspace
Note that for the in the above Example 6 we have an infinite number of eigenvectors by substituting various non-zero values of s:
etc
These solutions are given by all the points (apart from ) on the line shown below:
In general if A is a square matrix and is an eigenvalue of A with an eigenvector u then every scalar multiply (apart from 0) of the vector u is also an eigenvector belonging to the eigenvalue . We write this as a proposition:
Proposition (7.3). If is an eigenvalue of a square matrix A with an eigenvector u then every non-zero scalar multiplication of u is also an eigenvector belonging to .
Proof.
By (7.1) we have where is the eigenvalue and u is the eigenvector belonging to . Consider an arbitrary non-zero scalar k, then
Thus we have which means that is an eigenvector belonging to the eigenvalue . Since k was arbitrary therefore every non-zero scalar multiple of u is an eigenvector of the matrix A belonging to the eigenvalue .
■
Proposition (7.4). If A is a square matrix with an eigenvalue of , then the set of all eigenvectors of A belonging to together with the zero vector, O, that is the following set
is a subspace of .
How do we prove the given set is a subspace of ?
We can use proposition (4.7) of chapter 4 which says that any non-empty subset S of a
vector space is a subspace if and only if
(a) The zero vector, O, is in S
(b) If vectors u and v are in S then for any scalars and c we have is also in S.
Proof.
What do we need to prove?
We need to prove the zero vector, O, is in the set S and if u and v are eigenvectors belonging to the eigenvalue then is also an eigenvector belonging to .
Clearly by the definition of the set S we have the zero vector in S.
Let u and v be eigenvectors belonging to the eigenvalue and k and c be any non-zero scalars. Then by the above Proposition (7.3) we have
and
We need to show that :
Thus is an eigenvector belonging to which means it is a member of the set S. By Proposition (4.7) we conclude that the set S is a subspace of .
■
This subspace S of Proposition (7.4)
is called an eigenspace of λ and is denoted by , that is .
For example the eigenspace relating to Example 4 for the eigenvalue is the eigenvector and for the eigenvector is shown below:
Note that the vector is a basis for the eigenspace and is a basis for the eigenspace . The simplest eigenvectors are a basis for each eigenspace.
The line in Fig 4 with arrows pointing to it is the eigenspace of Example 5.
We can also use the numerical software MATLAB to find eigenvalues and eigenvectors as the following commands verify:
Let and enter this matrix in MATLAB. The command eig(A) gives you the eigenvalues of this matrix which is . The following command
> [V,d]=eig(A) should show