Our first definition will be familiar to most, although our notation and terminology may be a little different.
A (real) polynomial in the indeterminate is an expression of the form
where and . The numbers are called the coefficients of the polynomial.
We shall often write , , etc., to represent polynomials.
and are both polynomials.
In a polynomial the terms may be written in any order; for instance, in Example 7.1.2 we could also write
Any term with coefficient zero can be omitted, so that is the same polynomial as . The zero polynomial is simply written ; by removing or inserting zero terms if necessary, any non-zero polynomial can be written in the form with .
If is a non-zero polynomial with , then is the degree of , written , and is its highest (or leading) term. The zero polynomial has no degree and no highest term.
The polynomial has degree and highest term
Note that polynomials of degree are just non-zero real numbers, such as or (called constants, or scalars). Polynomials of degree , such as or , are called linear polynomials (because the graph of the function is a straight line); those of degree , such as , are quadratic; those of degree , such as , are cubic.
We can now state precisely what it means for two polynomials and to be equal. Indeed, if and only if either and are both the zero polynomial, or they have the same degree, say , and when they are written in the form
we have for each
Polynomials have an arithmetic which is rather similar to that of the integers. We can always add or subtract two polynomials to obtain another, by adding or subtracting ‘‘like terms’’ (in which the power of is the same). We can also multiply any two polynomials to obtain another; this is more complicated, but just involves multiplying ‘‘term by term’’ according to the rule for , and then gathering together like terms.
Let and . Then
Note that if and are non-zero polynomials with highest terms and , respectively, then the highest term of the product is Thus the degree of the product is the sum of the degrees of and :
(7.1.1) |
This easy observation has an important consequence. We saw in the example above that when multiplying two polynomials it may sometimes happen that certain terms ‘‘cancel out’’ (as and ). However, it is not possible for all terms to do so, because the highest term cannot. Thus the product of two non-zero polynomials is non-zero; or to put it another way,
(7.1.2) |
As in the case of , we refer to this fact by saying that the polynomials have no zero divisors.
So far everything has been fairly straightforward; just as with the integers, however, it is when we come to division that things get difficult. If divides , meaning that for some , we shall write ; but our observation above on highest terms in products makes it easy to see that this does not always happen. For example, let and . If for some , then would have to be non-zero (as is), so that the degree of the right-hand side would be at least (the degree of ), while the degree of the left-hand side is — a contradiction.
Essentially the problem here is that we are trying to divide something ‘‘small’’ by something ‘‘large’’; is ‘‘too big’’ to divide . We met a similar situation when we first came across division of integers: is ‘‘too big’’ to divide , for example. In that setting we found that there are two possible ways forward. One is to accept that, in dividing by a number , we cannot divide anything smaller than ; thus we look for an answer of the form ‘‘quotient with remainder’’ (as we did in Chapter 4). The other is to enlarge our number system by introducing fractions (as we did when constructing from in Section 6.4), which then allows us to divide any number by any other, provided of course that the second is non-zero.
Each of these two approaches has its counterpart in the world of polynomials. Here we shall take the first. Thus, given two polynomials and (with ), we seek to divide by to obtain a quotient and a remainder satisfying
The aim is to do this in such a way that is ‘‘smaller’’ than ; we use the degree of a polynomial as a measure of its size, so we want the degree of to be less than that of (or ). It is not immediately obvious that we can always do this, but it is in fact possible. Briefly, the trick is to do the division step by step, reducing the degree of the polynomial being divided at each stage. We call this method of division with remainder the division algorithm for polynomials. Before we formally state and prove this theorem, let us consider an example.
Use the division algorithm to divide by .
Solution.
thus
The following lemma formalizes the ‘‘reduction-of-degree’’ approach that the division algorithm relies on.
Let and be non-zero polynomials with say
where and . Define . Then either or
Proof. We have
so that
This shows that either (if all the coefficients on the right-hand side vanish), or this polynomial has degree at most
(Polynomial division with remainder.) Let and be polynomials, and suppose that . Then there are unique polynomials and such that and either or .
Terminology: the polynomial is called the quotient, while is the remainder.
Proof. We begin with the existence part. In the special case where divides , we have for some polynomial , and we may simply take . Note: this case covers (because all polynomials divide ), and also the case where has degree (because if for some , then we have ). Hence we may suppose that , so that is defined, and that . In this case we establish the existence of and by induction on , using the Generalized (or Second) Principle of Induction, where the inductive step consists of showing that if the result holds for each (not just for ), then it also holds for .
The base step () is easy. Since , and have the required properties.
Now let and assume inductively that, for each non-zero polynomial of degree less than , there are polynomials and such that
(7.1.3) |
If , then we can take and . Otherwise we choose as in Lemma 7.1.9 and define , so that either or . In the first case, we can take and . In the second case, the induction hypothesis implies that there are polynomials and satisfying (7.1.3), and we have
Hence the identity holds if we define and , and the second half of (7.1.3) ensures that this choice of has the required properties. Thus the induction continues. Note that this inductive argument is essentially just a formal version of the division algorithm that we used in Example 7.1.8.
To prove that the polynomials and are unique, suppose that , , and are polynomials such that and either we have or for . Our aim is to show that and . Rearranging the defining equations, we see that
which implies that
(7.1.4) |
Assume towards a contradiction that , so that the left-hand side of (7.1) is non-zero. On the one hand, this implies that
by (7.1.1). On the other hand, because and are either or of smaller degree than . This is clearly impossible, so we must have . Combining this with (7.1), we obtain . Since , we deduce that by (7.1.2).
We conclude this section with two remarks about polynomials in general.
One can consider polynomials over number systems other than , such as (called ‘‘integral polynomials’’), (‘‘rational polynomials’’) or (‘‘complex polynomials’’), or even (called ‘‘polynomials over ’’) for .
We do not think of polynomials as functions (that is, ‘‘machines’’ that map one real number to another). This is why we refer to as an ‘‘indeterminate’’, not a ‘‘variable’’. To emphasize this difference, we use a capital rather than a small .
To exemplify the difference between polynomials and functions, consider the polynomials and over . They are, by definition, distinct polynomials; however, we have
so if we regarded and as functions from to , they would actually be equal!