Home page for accesible maths 6 Linear transformations

Style control - access keys in brackets

Font (2 3) - + Letter spacing (4 5) - + Word spacing (6 7) - + Line spacing (8 9) - +

6.1. Definition and first examples

Definition 6.1.1.

We will use the word transformation to refer to any function T from n to m. We will write

T:nm.

So if vn then T(v)m. In this case we call T(v) the image of v.

Definition 6.1.2.

A transformation T:nm is called a linear transformation if T satisfies the two axioms

  • LT1

    T(v+w)=T(v)+T(w) for all v,wn, and

  • LT2

    T(λv)=λT(v) for all λ and all vn.

Recall Definition 1.2.1, which says that for λ, the vector λv is obtained by multiplying each coordinate by λ.

Example 6.1.3.

  • The following transformations T: are linear transformations:

    T(x)=2x,T(x)=x,T(x)=0.

    The following transformations T: are not linear:

    T(x)=x+1,T(x)=-2,T(x)=sinx,T(x)=x2.
Remark 6.1.4.

If T is linear, then T(0+0)=T(0)+T(0). Therefore, T(0)=0.

Example 6.1.5.

  • Let AM2(), and consider the transformation T:22 given by T(v)=Av for each v2. Since v is a column vector, the matrix product Av is also a column vector. T defined in this way is a linear transformation. This follows from Lemma 1.4.10. In particular, LT1 is satisfied, because A(v+w)=Av+Aw, and LT2 is satisfied because A(λv)=λ(Av) for any λ. So T is called the linear transformation associated to the matrix A.

The fundamental result of this section is that all linear transformations come from matrices, in the sense of Example 6.1. Before we make this statement more precise, let’s introduce some of the standard transformations. We will focus on three main types of transformations of 2: translation, rotation, and reflection. Translation is not a linear transformation (by 6.1.4), but rotation and reflection are.

Definition 6.1.6.

Translation by a vector. This is an example of a transformation which is not linear. Given a vector v=(ab)2, the translation by v is the transformation given by adding v to every vector z2 (See Figure 1). In other words, T(z):=z+v. It may be written in coordinates as follows:

Tv(x,y)=(x+a,y+b)for all(x,y)2.
Figure 1. The vector z+v is the translation of z by v.
Definition 6.1.7.

Rotation by an angle. Given an angle θ[0,2π) (in radians), the rotation through θ is the anticlockwise rotation Rθ through θ about the origin (see Figure 2).

Figure 2. The point P is mapped to the point P by the rotation by angle θ around the origin (0,0). The vector corresponding to P makes an angle α with the x-axis.

Rotation around the origin is the linear transformation associated to the following matrix (in the sense of Example 6.1):

Rθ=(cosθ-sinθsinθcosθ).

For the proof of this fact, see Proposition 6.2.8.

Example 6.1.8.

    1. (a)

      When θ=0, then nothing is rotated, so the transformation should be the identity transformation. Indeed:

      R0(xy)=(cos0-sin0sin0cos0)(xy)=(x+0y0x+y)=(xy)for all(xy)2.

      In other words, R0=I2.

    2. (b)

      When θ=π, rotation by the angle π should be the same as negating every vector. Indeed:

      Rπ(xy)=(cosπ-sinπsinπcosπ)(xy)=(-x-y)for all(xy)2.
Definition 6.1.9.

Reflection about a line. Given a line l through the origin which makes an angle θ above the positive x-axis, the reflection about l is the transformation Hθ mapping any point (x,y)2 to its reflection, assuming the line l is a mirror.

Figure 3. The point P is the image of P by the reflection about the line l. The line l makes the angle θ with the x-axis.

Reflection about a line through the origin is the linear transformation associated to the following matrix (in the sense of Example 6.1):

Hθ=(cos2θsin2θsin2θ-cos2θ).

For the proof of this fact, see Proposition 6.2.9.

Example 6.1.10.

    1. (a)

      When θ=0, the reflecting line is the x-axis, so we have

      H0(xy)=(cos0sin0sin0-cos0)(xy)=(x+0y0x-y)=(x-y)for all(xy)2.

      This makes sense, because a reflection in the x-axis should simply multiply the y-coordinate by -1.

    2. (b)

      When θ=π2, the reflecting line is the y-axis, and we have

      Hπ2(xy)=(cosπsinπsinπ-cosπ)(xy)=(-xy)for all(xy)2;

      again, this is as one would expect.