Home page for accesible maths 1 Fields and Matrices

Style control - access keys in brackets

Font (2 3) - + Letter spacing (4 5) - + Word spacing (6 7) - + Line spacing (8 9) - +

1.B Row operations and reduced echelon form

At this point, an inquisitive student (which I hope you are) should be asking various questions, such as: “Why should we care about fields? What is the purpose of these axioms”? To answer these questions, let’s recall some facts from MATH105.

Definition 1.3:

Firstly, choose a field F, which is usually either F= or F=. Then, let AMn×m(F) be matrix with n rows and m columns, whose coefficients are in the set F. Then an elementary row operation (e.r.o.) is defined to be one of the following three operations:

  1. 1

    Ri=ri+λrj For λF, add λ times row j to row i.

  2. 2

    Ri=λri For 0λF, multiply row i by λ.

  3. 3

    RiRj Swap rows i and j.

We also define elementary column operation (e.c.o.) to be the same as above, except replacing the rows (Ri and ri) with columns (Ci and ci). For example, multiplying a column by a non-zero scalar is an e.c.o., but not an e.r.o.

Theorem 1.4.

For F=R, any e.r.o.  moves matrices from Mn×m(R) to Mn×m(R).

Considering the second e.r.o., this theorem says that if you take any matrix with real coefficients, and multiply a row by a real number, then you end up with another matrix with real coefficients. That’s pretty obvious. Now consider the next two theorems, which are only slightly less obvious.

Theorem 1.5.

For F=C, any e.r.o.  moves matrices from Mn×m(C) to Mn×m(C).

Theorem 1.6.

For F=Q, any e.r.o.  moves matrices from Mn×m(Q) to Mn×m(Q).

These are three distinct theorems, and you are expected to know all three of them. But they somehow seem “the same”. Rather than proving each one separately, we will prove the following generalization. Since ,, and are all fields, we are proving all three of the above theorems simultaneously. In the proof we are only allowed to use the field axioms.

Theorem 1.7.

For any field F, any e.r.o.  moves matrices from Mn×m(F) to Mn×m(F).

Proof.

Firstly, it is clear that the size of the matrix does not change. So we only need to check that the coefficients of the new matrix are still in F.

Consider the first row operation Ri=ri+λrj, where λF. It changes the coefficients of the ith row from aik to aik+λajk. By F2, λajkF, and therefore by F1 we know aik+λajkF. This is true for every k=1,2,,m, and so the new row has coefficients all in F.

Next, consider the second row operation Ri=λri, where 0λF. Again, by F2, multiplication is a binary operation, so the new coefficients λaik are all still in F.

Finally, for the third row operation, the coefficients after the swapping operation are still all in F. ∎

Exercise 1.8:

For F=, give an example of a matrix in M3(), and an e.r.o. (over ) which moves your matrix to a matrix not in M3().

[End of Exercise]

We will say that the matrix B is row-equivalent to the matrix A if B can be obtained from A by performing a finite sequence of elementary row operations on A (in fact, this forms an equivalence relation; see Exercise 1.19). The following definition and theorem should also be familiar from MATH105.

Definition 1.9:

A matrix is in reduced row echelon form if the following conditions are satisfied:

  1. i.

    The leading coefficient in each non-zero row is 1

  2. ii.

    Each leading coefficient is the only non-zero entry in its column

  3. iii.

    All the zero rows are in the bottom rows, and as the row numbers increase, the column numbers of the leading coefficients also (strictly) increase; i.e. the matrix is in echelon form.

Theorem 1.10.

Let F be a field. Then any matrix with coefficients in F can put into reduced echelon form by a sequence of e.r.o.’s. Furthermore, the reduced echelon form of any matrix is unique; in other words, it is independent of the sequence of e.r.o.’s.

The above theorem should be familiar when the field is F=. We will not give the proof; the general case of the proof is the same as the real case, because the only properties of the real numbers that were used are contained in the list of field axioms F1 to F11.

Exercise 1.11:

Are the matrices [120001000] and [100010000]M3() row-equivalent to each other? Justify your answer by applying Theorem 1.10.

[End of Exercise]