The material of this section provides some of the most useful mathematical tools available for scientists, engineers, social scientists, and others. We will see how to solve systems of linear equations (see Definition 5.1.2) by using the elementary row operations introduced in Section 2. This technique was used by Chinese mathematicians over two thousand years ago. We will see how to translate a system of linear equations into an augmented matrix, and then row reduce that matrix to solve the equations.
The purpose of the next two examples is to demonstrate what is meant by “solving systems of linear equations”, and to convince you that this is a useful and natural thing to do. These examples show how linear equations may arise in real-life contexts, and we will use them in order to understand a general system of linear equations.
Example 5.1.1.
Let be the total cost, in pounds, of producing a certain item and let be the number of items produced. Suppose that manufacturer A has a start-up cost of £ and a cost per item of £, which gives the equation
(2) |
Suppose also that manufacturer B has a start-up price of £ and each item costs £, which yields the equation
(3) |
Which manufacturer should be used?
The equations given by this example are linear, i.e. each variable, here and , appears with exponent at most one. Also, note that we use the fraction notation instead of the decimal notation in the algebraic equation.
Thus, to answer this question, we start by plotting the two manufacturers’ costs:
From this diagram, it is clear that the decision depends on the number of items required. Namely, manufacturer A, corresponding to (2), has a lower cost than manufacturer B, as long as the number of items required is less than the -coordinate corresponding to the intersection of the two lines in the graph. This intersection point translates into the fact that both manufacturers’ costs are equal, and we see that to its right, then manufacturer B is cheaper. Thus, we need to find the coordinates of the intersection point. It is given by the values of and which simultaneously satisfy Equations (2) and (3).
To find these values, we rearrange the equations in an array, i.e. a matrix, having two rows, say and .
We now perform elementary row operations on this “matrix” in order to put it in reduced echelon form. Hence, we first do and get new equations
So we deduce that . In other words, we perform the row operation .
To solve for , we substitute into ; in other words we perform :
This is the solution to the system of linear equations. In the terminology of Section 2, we have found the reduced row echelon form of the linear system of equations.
The answer to the original question is that as long as we need no more than items, then manufacturer A is cheaper than manufacturer B. If we need exactly items, then both manufacturers will be equally expensive, namely £. Finally, if we need more than items, manufacturer B is cheaper.
A system of linear equations in variables , is a collection of equations in which the variables each occur separately with exponent at most 1.
To solve such a system, means to find all possible values of which satisfy all of the equations simultaneously. A set of such values is also called a solution to the system of equations.
If no solution exists, then the system is said to be inconsistent.
Example 5.1.3.
The following equations are linear in the variables and :
The following equations are not linear in the variables and :
The coefficients of and in the given system of linear equations determine the number of solutions. For instance, let us see see what happens when we slightly modify Example 5.1 by taking the equations
In this case, the graphs are parallel lines, saying that manufacturer A is always cheaper than B.
The geometrical fact that the lines do not intersect corresponds to the algebraic fact that the system of equations has no solution. That is, there are no values of and which simultaneously satisfy both equalities, so the system is inconsistent. How can we recognise this situation when there are more than two equations? In the next example we have three equations.
Example 5.1.4.
Assume an animal’s diet is based on three different types of food A,B and C, where a unit of each one contains the following amount of protein, carbohydrate and fat:
protein | carbohydrate | fat | |
A | 5 | 3 | 4 |
B | 2 | 2 | 1 |
C | 3 | 1 | 4 |
We require exactly units of protein, units of carbohydrate and units of fat. How many units of A,B and C do we need?
First we translate the word problem into mathematical expressions. The word “exactly” indicates an equality. For instance, counting the amount of protein, we have:
{# units of A} + { # units of B} + { # units of C } units.
In order to further translate the problem, let us give names to our variables, say and , for the numbers of units of A,B and C respectively. The above conditions give rise to the following system of linear equations in and .
If we try to proceed as in the first example, we soon run into difficulties, since the equations now represent planes in the three dimensional space. A graphical solution becomes harder to handle by hand. To solve the problem we proceed algebraically by solving the three equations simultaneously for and . Starting from the above array, one way of proceeding is to eliminate variables by using row operations. We eliminate from and from by doing first and then . The resulting array is:
Hence, we do , and the new becomes the equation .
From it, we find . Now, let us substitute the solution for into any of the above equations, in order to determine the values of and . So
The row operation isn’t an elementary row operation, but it is the combination of one elementary row operation followed by another. So a non-elementary row operation is a sequence of elementary row operations. This shortened notation is sometimes preferable to writing out two separate elementary operations.