Mathematics 270 Linear Algebra I

Study Guide :: Unit 2

Inverse of a Matrix, Linear Systems and Special Forms of Matrices

In the previous unit we saw that matrix arithmetics has analogous properties to the arithmetics of numbers. The same occurs with the “inverse of a matrix.” As we know, if a real number a is different from zero, then there is an “inverse of a ,” denoted as a 1 , which is calculated as a 1 = 1 / a and has the property that a . a 1 = 1 . The first part of this unit is devoted to finding the analogous case for matrices. In other words, if we are given a matrix A , we want to find the conditions on the matrix A and a methodology such that we can find its inverse, A 1 . Thus, after reading the first two sections of this unit you will be able, when possible, to calculate the inverse of a matrix A 1 and understand that its definition implies that A 1 A = I , where I is the identity matrix.

You may be asking yourself, why do we want to define or study the “inverse,” A 1 , of a matrix A ?

The answer to this question is provided in the second part of this unit, where the focus is to explain how the inverse of a matrix is used for the purpose of obtaining information about—or even solving—a system of linear equations.

The focus of this unit, therefore, is the study of the inverse of a matrix, its properties and its relation to systems of linear equations.

Objectives

After completing Unit 2, you should be able to:

  1. determine whether or not a matrix is invertible;
  2. know and use the properties of the transpose and inverse matrices;
  3. prove basic properties involving invertible matrices;
  4. understand the concept of elementary matrices;
  5. use row operations to reduce a square matrix to the identity matrix;
  6. understand the relation between the inverse of a matrix and linear systems;
  7. understand the Equivalent Statements Theorem;
  8. find the inverse of a matrix using elementary matrices and row operations;
  9. solve linear systems using the inverse of the coefficient matrix;
  10. determine consistency of a linear system and determine whether it has one solution, infinitely many solutions or no solutions;
  11. understand what diagonal, triangular and symmetric matrices are; and
  12. construct and analyze a Leontief Input-Output model given the information about an economy.

2.1  Inverse of a Matrix and Its Properties

This section introduces the definition of the inverse of a matrix and some of its algebraic properties. In particular, given a square matrix A of size n × n , if there exists a matrix B such that A B = B A = I , where I denotes the identity matrix, then we say that A is “invertible” and its “inverse” is the matrix B , which is denoted as A 1 .

From this definition there are three aspects worth noticing. The first one is that the definition of an inverse only refers to square matrices. The second aspect is that even if a matrix is square, there is no guarantee that the inverse of that matrix exists. Finally, the identity matrix I plays the analogous role that the number 1 plays in the real numbers (since A A 1 = I ). This role was already pointed out in the last section of Unit 1, where it was noted that the identity matrix, I , can be seen as analogous to number 1 in the sense that the multiplication of any matrix A times the identity I results in the matrix A .

Reading Assignment

Read, and study, pages A1–A4 (“Appendix A” after p. 713), and pages 43–49 of the textbook (to “Exercise Set 1.4”).

Since the properties of the inverse of a matrix are stated as theorems and properly justified by proofs, the first part of the reading in “Appendix A” of the textbook aims at helping you understand and becoming familiar with the basic structure of theorems. Keep in mind that the information given in this appendix will be useful throughout the whole course.

The reading also introduces you to the conditions under which a matrix of size 2 × 2 is invertible and offers you a simple formula for finding its inverse. Specifically, Anton and Rorres present the following theorem (p. 45):

Theorem 1.  The matrix

A = ( a b c d )

is invertible if and only if a d b c 0 , in which case the inverse is given by

A 1 = 1 a d b c ( d b c a ) . (2.1)

Notice that the only condition for the inverse of a 2 × 2 matrix A to exist is that a d b c 0 . The quantity a d b c is called the determinant of A and is denoted as

det ( A ) = | a b c d | = a d b c . (2.2)

Thus, Theorem 1 can be restated as

Theorem 2.  The matrix

A = ( a b c d )

is invertible if and only if det ( A ) 0 , in which case the inverse is given by

A 1 = 1 det ( A ) ( d b c a ) . (2.3)

Historically, the Equation (2.3) for the inverse of a matrix was introduced in the second part of the nineteenth century by the British mathematician Arthur Cayley, and even though Theorem 2 and Equation (2.3) are presented with no proof in this section, you will use them to calculate the inverse of 2 × 2 matrices and to solve systems of two linear equations with two unknowns.

Specifically, you will see in “Example 8” of the textbook (p. 45) that if the determinant of a coefficient matrix A of a system of two linear equations with two unknowns is not equal to zero (i.e., A is invertible according to Theorem 2), then the system of equations has a unique solution. This is the reason for the name “determinant” of A , since it is this quantity, det ( A ) , that “determines” whether or not the system has a unique solution.

Exercises

Work on the following textbook exercises from “Exercise Set 1.4” (pp. 49–51):

  • 3, 5, 7, 11, 13, 15, 16, 17, 19, 21, 25, 27, 31, 33(a); and
  • “True-False Exercises” (a, b, d, e, f).

2.2  A Method for Finding the Inverse A 1

The main objective of this section is to provide a step-by-step method for finding the inverse of a given square matrix A of any size. This methodology can be summarized in the following theorem:

Theorem 3.  If a square matrix A of size n × n can be reduced to the identity matrix I n using elementary row operations (i.e., the reduced row echelon form of A is I n ), then the inverse A 1 can be found by applying the same row operations to the identity matrix I n .

To illustrate this methodology, let’s suppose we want to find the inverse of the matrix

A = ( 3 0 2 1 ) (2.4)

We know that if we multiply the first row of A by 1 / 3 (row operation R 1 ) and then multiply the first row of the resulting matrix by 2 and add it up to the second row (row operation R 2 ), we obtain the identity matrix I 2 as the reduced row echelon form of A . In other words,

A = ( 3 0 2 1 ) R 1 ( 1 0 2 1 ) R 2 ( 1 0 0 1 ) = I 2 . (2.5)

According to Theorem 3, we can find the inverse A 1 by applying these same row operations ( R 1 and R 2 ) to the identity matrix I 2 . Thus,

I 2 = ( 1 0 0 1 ) R 1 ( 1 / 3 0 0 1 ) R 2 ( 1 / 3 0 2 / 3 1 ) = A 1 . (2.6)

You can easily verify in this example that A A 1 = I 2 .

Notice that we apply the same elementary row operations to both the matrix A and the identity matrix I 2 . Thus, in order to apply these operations at the same time, the results of the procedure for finding the inverse A 1 are typically presented in the following form

( A | I 2 ) = ( 3 0 2 1 | 1 0 0 1 ) R 1 ( 1 0 2 1 | 1 / 3 0 0 1 )

R 2 ( 1 0 0 1 | 1 / 3 0 2 / 3 1 ) = ( I 2 | A 1 ) .

Elementary Matrices and Equivalent Statements Theorem

In order to provide a formal proof of the step-by-step methodology for finding the inverse of a matrix presented in Theorem 3, it is necessary to introduce the concept of an “elementary matrix,” which is defined as a matrix that is obtained from applying a single row operation to an identity matrix. For example, the matrix

E = ( 1 / 3 0 0 1 ) (2.7)

in Equation (2.6) is an elementary matrix obtained by multiplying the first row of the identity matrix by 1 / 3 .

Now, with the definition of an elementary matrix, we are able to establish an equivalent relationship between applying a row operation on a given matrix A and multiplying the corresponding elementary matrix (of that row operation) to the matrix A . To illustrate this, consider, for example, the matrix A given by Equation (2.4). By multiplying its first row by 1 / 3 (operation R 1 ) , we obtain a matrix B , i.e.,

A = ( 3 0 2 1 ) R 1 B = ( 1 0 2 1 ) . (2.8)

Notice now that this matrix B can also be obtained by multiplying the matrix A by the elementary matrix E given by Equation (2.7) (obtained by multiplying the first row of the identity matrix by 1 / 3 ), i.e.,

E A = ( 1 / 3 0 0 1 ) ( 3 0 2 1 ) = ( 1 0 2 1 ) = B . (2.9)

With this equivalent relationship in mind, you will be able to understand the proof of the following theorem presented in the textbook (p. 53):

Theorem 4.  Equivalent Statements

If A is an n × n matrix, then the following statements are equivalent (they are all true or all false):

  1. A is invertible.
  2. A x = 0 has only the trivial solution.
  3. The reduced row echelon form of A is I n .
  4. A can be expressed as a product of elementary matrices.

After completing the assigned reading you will see that the proof of this theorem does actually contains the methodology for finding the inverse of a matrix explained at the beginning of this section. In other words, it contains the proof of Theorem 3.

The importance of Theorem 4 stems from the fact that it groups together many of the linear algebra topics studied in this course. Until now, it is clear that it associates results about the invertibility of matrices, homogenous linear systems, reduced row echelon forms, and elementary matrices. We will be able to keep adding more statements to it as we advance through the course. We will, for example, be able to add results related to nonhomogeneous linear systems and determinants of matrices. Thus, it is very important to keep this theorem is mind throughout the course. It will serve as an excellent skeleton on which to put the muscle of our linear algebra knowledge.

Reading Assignment

Read, and study, pages 52–58 of the textbook (to “Exercise Set 1.5”). This reading provides all the necessary details to understand and prove the Equivalent Statements Theorem 4 and, therefore, the methodology, given by Theorem 3, for finding the inverse of a matrix. You will have the opportunity to see, with examples, how the results are applied.

Exercises

Work on the following textbook exercises from “Exercise Set 1.5” (pp. 58–60):

  • 1(a, b, d), 3, 5(a, b), 7(a, d), 9, 11, 15, 17, 19, 20, 21, 23, 25, 27, 33, 35, 37; and
  • “True-False Exercises” (a, b, c, d, f).

2.3  Linear Systems and the Inverse of a Matrix

Now that we have some knowledge of the inverse of a matrix and how to calculate it, we are able to come back to the question posed at the introduction of this unit: why do we want to define or study the “inverse,” A 1 , of a matrix A ?

The main reason is that its definition, its existence (or non-existence) and even its properties provide us with useful information about the solution of a system of linear equations.

To illustrate briefly the importance of the inverse of a matrix, imagine that you want to solve the linear system given in its matrix form by Equation (1.8), where the matrix A is square. In other words, imagine that you want to solve the following linear system with the same number of equations than unknowns,

A x = b . (2.10)

If we could find the inverse A 1 of the coefficient matrix A , we could then multiply both sides of Equation (2.10) by A 1 and obtain that

A 1 A x = A 1 b . (2.11)

Now, noticing that A 1 A = I , Equation (2.11) becomes

I x = A 1 b (2.12)

or, equivalently,

x = A 1 b . (2.13)

Thus, if the inverse of the coefficient matrix A exists and is found, then the linear system given by Equation (2.10) can be solved and its solution can be given explicitly by Equation (2.13). This seems to offer us another statement to add to the Equivalent Statements Theorem 4. In fact, after completing the corresponding reading, you will be able to prove the next extended version of the Equivalent Statements Theorem.

Theorem 5.  Equivalent Statements

If A is an n × n matrix, then the following statements are equivalent (they are all true or all false):

  1. A is invertible.
  2. A x = 0 has only the trivial solution.
  3. The reduced row echelon form of A is I n .
  4. A can be expressed as a product of elementary matrices.
  5. A x = b is consistent for every n × 1 matrix b .
  6. A x = b has exactly one solution for every vector n × 1 matrix b .

Reading Assignment

Read, and study, pages 61–66 of the textbook (to “Exercise Set 1.6”). This textbook reading guides you through the fascinating relation between the inverse of a matrix and a system of linear equations. Pay close attention to the examples for determining consistency conditions in the textbook (“Examples 3 and 4”, pp. 65–66).

Exercises

Work on the following textbook exercises from “Exercise Set 1.6” (pp. 66–67):

  • 1, 3, 5, 7, 9, 13, 15, 18, 19, 21, 23 and
  • “True-False Exercises” (a, b, f).

2.4  Special Forms of Matrices

This section introduces three special forms of matrices that appear commonly in linear algebra applications:

  1. Diagonal matrices.
  2. Triangular matrices.
  3. Symmetric matrices.

Reading Assignment

Read, and study, pages 67–72 of the textbook (to “Exercise Set 1.7”).

These pages introduce the definitions and properties of the diagonal, triangular and symmetric matrices.

Exercises

Work on the following textbook exercises from “Exercise Set 1.7” (pp. 72–75):

  • 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 26, 27, 29, 31, 35, 41, 47; and
  • “True-False Exercises” (a, b, c, d, h, i, j, m).

2.5  Applications of Linear Algebra to Economic Systems: Leontief Economic Models

This section introduces a fascinating example of how linear algebra can be applied to the field of economics. In particular, the section discusses the Leontief Input-Output Models, linear algebra models that were worth a Nobel prize in 1973, developed by the Russian-American economist Wassily Leontief. Leontief’s analysis is based on the division of an economy in sectors, which, on the one hand, can produce “outputs” and, on the other hand, require “inputs” from other sectors. This information, organized in matrices, allows the study of the influence of different sectors in an economy on each other.

Reading Assignment

Read, and study, pages 96–100 of the textbook (to “Exercise Set 1.10”). This reading provides a comprehensible introduction to the analysis of an economy when it is divided into sectors using linear algebra. The linear algebra models arising from this analysis are called, due to their developer, Leontief’s Input-Output Models. The reading illustrates, for example, how an economy can meet its production demands, or under which conditions an economy is productive. Thus, close attention should be paid to the examples provided in the reading.

Exercises

Work on the following textbook exercises from “Exercise Set 1.10” (pp. 100–101):

  • 1, 3, 5, 7, 11; and
  • “True-False Exercises” (a, b, c, d, e).