Quantum Mechanics — Lecture notes for PHYS223 VII Momentum probabilities and the uncertainty principle IX General principles of quantum mechanics

VIII Mathematical interlude: Hilbert spaces and linear operators

In the next chapter we will formulate the general principles of quantum mechanics. Here, we introduce the necessary mathematical framework, which concerns vectors, scalar products (a complex version of the dot product), and operators. We describe these entities in the powerful Dirac notation, which is widely used throughout quantum mechanics.

VIII.1 Vectors

The most common form of a vector is a collection of components $\psi_{n}$ , $n=1,\ldots,{\cal N}$ , where ${\cal N}$ denotes the dimension of the vector space. A function $\psi(x)$ can be consider as a vector in which the discrete index $n$ has been replaced by a continuous index $x$ . It is useful to introduce a formalism in which this analogy can be exploited without direct reference to the specific forms of the components $\psi_{n}$ or $\psi(x)$ .

In Dirac notation, vectors are denoted as $|\psi\rangle$ . These vectors form a complex linear vector space, which entails the following properties: Any vector $|\psi\rangle$ can be scaled by any complex number $\alpha$ , i.e., we can form new vectors $|\alpha\psi\rangle=\alpha|\psi\rangle$ . Furthermore, any two vectors $|\psi\rangle$ , $|\chi\rangle$ can be combined into new vectors by a forming a superposition $|\psi+\chi\rangle=|\psi\rangle+|\chi\rangle$ . These operations obey the distributive law $(\alpha+\beta)(|\psi\rangle+|\chi\rangle)=|\alpha\psi\rangle+|\beta\psi\rangle% +|\alpha\chi\rangle+|\beta\chi\rangle$ . In addition, a vector space possesses a null vector $|\mathbb{0}\rangle$ such that $|\mathbb{0}\rangle+|\psi\rangle=|\psi\rangle$ , and to each vector $|\psi\rangle$ there is an inverse vector $|{-\psi}\rangle=-|\psi\rangle$ such that $|\psi\rangle-|\psi\rangle=|\mathbb{0}\rangle$

These properties are all nicely fulfilled for functions. In particular, if $\psi(x)$ and $\chi(x)$ are functions and $\alpha$ , $\beta$ are constants, then $\alpha\psi(x)+\beta\chi(x)$ is also a function.

VIII.2 Scalar product, norm and orthogonality

The scalar product is a generalised versions of the dot product, which associates a complex number $\langle\psi|\chi\rangle$ to any pair of vectors $|\psi\rangle$ , $|\chi\rangle$ . The scalar product fulfils the important property

\langle\psi|\chi\rangle=\langle\chi|\psi\rangle^{*}.

(116)

Consistently with this, the scalar product is linear in the second argument, but conjugate linear in the first argument, i.e., $\langle\psi|\alpha\chi\rangle=\alpha\langle\psi|\chi\rangle$ , $\langle\alpha\psi|\chi\rangle=\alpha^{*}\langle\psi|\chi\rangle$ , $\langle\psi+\varphi|\chi\rangle=\langle\psi|\chi\rangle+\langle\varphi|\chi\rangle$ , $\langle\psi|\varphi+\chi\rangle=\langle\psi|\varphi\rangle+\langle\psi|\chi\rangle$ .

In general, a scalar product must be positive definite, $\langle\psi|\psi\rangle>0$ for $|\psi\rangle\neq|\mathbb{0}\rangle$ . We call $||\psi||=\sqrt{\langle\psi|\psi\rangle}$ the norm of the vector $|\psi\rangle$ (this generalises the notion of length of an ordinary vector). A vector with $\langle\psi|\psi\rangle=1$ is called normalised (this generalises the notion of a unit vector). The procedure of passing from a vector $|\psi\rangle$ to the normalised vector $(1/||\psi||)|\psi\rangle$ is called normalisation. Again in analogy to the case of ordinary vectors, two vectors $|\psi\rangle$ , $|\chi\rangle$ fulfilling $\langle\psi|\chi\rangle=0$ are said to be orthogonal to each other.

In conjunction with a certain completeness condition which is always fulfilled in quantum mechanics, a vector space equipped with a scalar product is called a Hilbert space.

Formally, the scalar product can be interpreted as a product $\langle\psi|\cdot|\chi\rangle$ between the vectors $|\chi\rangle$ and the entities $\langle\psi|$ , which form the dual vector space. They represent the left entries in the scalar product and therefore are also conjugate linear: $\langle\alpha\psi+\beta\chi|=\alpha^{*}\langle\psi|+\beta^{*}\langle\chi|$ . A dual vector is also called a bra, and an ordinary vector is called a ket, alluding to the fact that in the scalar product $\langle\psi|\chi\rangle$ they form a bracket (bra-ket). The introduction of these dual vectors is an important step in the Dirac notation; its usefulness will become clear when we discuss operators (generalised matrices).

VIII.3 Basis

A basis is a collection of vectors $|n\rangle,\quad n=1,2,3,\ldots,{\cal N}$ such that any vector can be written as a superposition $|\psi\rangle=\sum_{n=1}^{\cal N}\psi_{n}|n\rangle$ , where the complex coefficients $\psi_{n}$ are unique. The coefficients $\psi_{n}$ give a representation of the vector, and can be written as a column vector

\psi=\left(\begin{array}[]{c}\psi_{1}\\ \psi_{2}\\ \vdots\\ \psi_{\cal N}\end{array}\right).

(117)

The corresponding dual vector is written as a row vector $\psi^{\dagger}=(\psi_{1}^{*},\psi_{2}^{*},\ldots,\psi_{\cal N}^{*})$ . While there are many possible bases, in which the same vector is represented by different coefficients, the number ${\cal N}$ of basis vectors required to obtain all vectors is always the same, and is called the dimension of the vector space ( ${\cal N}$ may be $\infty$ ).

An orthogonal basis fulfills $\langle n|m\rangle=0$ for any $n\neq m$ . If furthermore $\langle n|n\rangle=1$ for all $n$ one speaks of an orthonormal basis. In such a basis, the coefficients representing a vector are given by $\psi_{n}=\langle n|\psi\rangle$ , and the scalar product takes the explicit form

\langle\psi|\chi\rangle=\sum_{n}\psi_{n}^{*}\chi_{n}=\psi^{\dagger}\chi.

(118)

Thus, a vector $|\psi\rangle$ is normalised if its coefficients in an orthonormal basis obey

\sum_{n}|\psi_{n}|^{2}=1.

(119)

For functions $\psi(x)$ , the summation over the discrete index is replaced by an integration, $|\psi\rangle=\int\psi(x)|x\rangle\,dx$ . In this case the dimension of the vector space is infinite. The orthonormality of a basis can be stated with help of the Dirac delta function, $\langle x|x^{\prime}\rangle=\delta(x-x^{\prime})$ . In the scalar product, the summation over the discrete index $n=1,2,\ldots$ is again replaced by an integration over the continuous index $x$ ,

\langle\psi|\varphi\rangle=\int\psi^{*}(x)\varphi(x)\,dx.

(120)

This type of integral is called an overlap integral. The expression for the expansion coefficients takes the form $\psi(x)=\langle x|\psi\rangle$ , and the normalisation condition translates into

\int|\psi(x)|^{2}\,dx=1.

VIII.4 Linear operators

An operator $\hat{A}$ converts any vector $|\psi\rangle$ into another vector $|\hat{A}\psi\rangle=\hat{A}|\psi\rangle$ . Linear operators fulfill $\hat{A}(\alpha|\psi\rangle+\beta|\chi\rangle)=\alpha\hat{A}|\psi\rangle+\beta% \hat{A}|\chi\rangle$ , where $\alpha$ , $\beta$ are complex numbers. Operators can be added according to the rule $(\hat{A}+\hat{B})|\psi\rangle=\hat{A}|\psi\rangle+\hat{B}|\psi\rangle$ , and multiplied according to the rule $(\hat{B}\hat{A})|\psi\rangle=\hat{B}(\hat{A}|\psi\rangle)$ .

In Dirac notation, operators are written as $\hat{A}=\sum_{nm}A_{nm}|n\rangle\langle m|$ , and the action of an operator is obtained from the multiplication rule $\langle m|\cdot|\psi\rangle=\langle m|\psi\rangle$ . Thus,

\hat{A}|\psi\rangle=\sum_{nm}A_{nm}|n\rangle\langle m|\psi\rangle=\sum_{n}% \left(\sum_{m}A_{nm}\langle m|\psi\rangle\right)|n\rangle.

(121)

Assuming that the states $|n\rangle$ , $|m\rangle$ in the definition of $\hat{A}$ form an orthonormal basis, the operator can be represented by $N\times N$ -dimensional square matrices

A=\left(\begin{array}[]{cccc}A_{11}&A_{12}&\dots&A_{1N}\\ A_{21}&A_{22}&\dots&A_{2N}\\ \vdots&\vdots&\ddots&\vdots\\ A_{N1}&A_{N2}&\dots&A_{NN}\end{array}\right)

(122)

where, the matrix elements $A_{nm}$ are obtained from $A_{nm}=\langle n|\hat{A}m\rangle\equiv\langle n|\hat{A}|m\rangle$ . Since in an orthonormal basis $\langle m|\psi\rangle=\psi_{m}$ , the operator then acts on a vector according to the standard rules of matrix multiplication, i.e., $|\varphi\rangle=\hat{A}|\psi\rangle$ is represented by a vector with coefficients $\varphi_{n}=\sum_{m}A_{nm}\psi_{m}$ . Furthermore, the operator addition and multiplication rules then translate to the usual prescriptions of matrix addition and multiplication.

VIII.5 Eigenvalues and eigenvectors

The action of an operator is particularly simple in its eigenrepresentation, defined by a basis fulfilling the eigenvalue equation $\hat{A}|n\rangle=a_{n}|n\rangle$ . The numbers $a_{n}$ are called eigenvalues, and the associated vectors $|n\rangle$ are called eigenvectors. When appropriate, these eigenvectors are also called eigenfunctions.

If the eigenvectors form an orthonormal basis (as is the case for the hermitian and unitary operators considered below), the eigenrepresentation results in a diagonal matrix, with $A_{nm}=0$ if $n\neq m$ and $A_{nn}=a_{n}$ . In Dirac notation, the operator can then be written as $\hat{A}=\sum_{n}a_{n}|n\rangle\langle n|$ .

VIII.6 Common types of operators

A particularly simple operator is the identity operator $\hat{I}$ which leaves all states unchanged, $\hat{I}|\psi\rangle=|\psi\rangle$ . Every state is therefore an eigenstate of $\hat{I}$ , with eigenvalue 1. Consequently, in any orthonormal basis this operator takes the same form $\hat{I}=\sum_{n}|n\rangle\langle n|$ . Representations are simply obtained by multiplying out the identities $|\psi\rangle=\hat{I}|\psi\rangle$ and $\hat{A}=\hat{I}\hat{A}\hat{I}$ . In a given orthonormal basis, it is useful to decompose the identity $\hat{I}=\sum\hat{P}_{n}$ as the sum of projection operators $\hat{P}_{n}=|n\rangle\langle n|$ , which fulfill $\hat{P}_{n}\hat{P}_{m}=0$ if $n\neq m$ , and $\hat{P}_{n}^{2}=\hat{P}_{n}$ .

For each operator $\hat{A}$ we can define an adjoint operator $\hat{A}^{\dagger}$ by setting $\langle\psi|\hat{A}^{\dagger}\chi\rangle=\langle\hat{A}\psi|\chi\rangle$ . In an orthonormal basis we then have $A^{\dagger}_{nm}=A_{mn}^{*}$ . For many operators, we can also define an inverse operator $\hat{A}^{-1}$ which fulfils $\hat{A}\hat{A}^{-1}=\hat{I}$ .

Two important types of operators are hermitian operators $\hat{H}$ and unitary operators $\hat{U}$ . For any two states $|\psi\rangle$ , $|\chi\rangle$ , hermitian operator fulfill $\langle\psi|\hat{H}\chi\rangle=\langle\hat{H}\psi|\chi\rangle$ , while unitary operators fulfill $\langle\hat{U}\psi|\hat{U}\chi\rangle=\langle\psi|\chi\rangle$ . This entails $\hat{H}=\hat{H}^{\dagger}$ and $\hat{U}^{\dagger}=\hat{U}^{-1}$ . In an orthonormal basis, the matrix elements of a hermitian operator fulfill $H_{nm}=H_{mn}^{*}$ , while those of a unitary operator fulfill $\sum_{l}U_{nl}U^{*}_{ml}=\delta_{nl}$ .

Both classes of operators have the nice property that their sets of normalised eigenvectors form an orthonormal basis. For hermitian operators, the eigenvalues $a_{n}$ are real, while for unitary operators they fulfill $|a_{n}|=1$ .

Unitary operators are analogous to orthogonal matrices which rotate a coordinate system. In particular, any basis change from one orthonormal basis $|n\rangle$ to another orthonormal basis $|\tilde{n}\rangle$ can be written as $|\tilde{n}\rangle=\hat{U}|n\rangle$ , where $\hat{U}$ is a suitable unitary operator. A common form of unitary operators relates them to a hermitian operator $\hat{H}$ via $\hat{U}=\exp(i\tau\hat{H})$ , where $\tau$ is a real constant and the exponential of an operator is defined via its Taylor expansion, $\exp(\hat{A})=\sum_{n=0}^{\infty}\hat{A}^{n}/n!$ . In this case, $\hat{U}^{-1}=\hat{U}^{\dagger}=\exp(-i\tau\hat{H})$ . Furthermore, the operators $\hat{H}$ and $\hat{U}$ then share the same eigenvectors: If $\hat{H}=\sum_{n}h_{n}|h_{n}\rangle\langle h_{n}|$ , then $\hat{U}=\sum_{n}u_{n}|h_{n}\rangle\langle h_{n}|$ with eigenvalues $u_{n}=\exp(i\tau h_{n})$ .