Appendix C - Vectors and Linear Algebra

1 Introduction
2 Vectors
3 Vector Operations
4 Matrices
5 More Dirac Notation
6 Transformations
- 6.1 Transformations of a Qubit
- 6.2 Similarity Transformation
7 Polar Decomposition and Singular Value Decomposition
8 Eigenvalues and Eigenvectors
9 Tensor Products
- 9.1 Properties of Tensor Products
10 Exercises

Introduction

This appendix introduces some aspects of linear algebra and complex algebra that will be helpful for the course. In addition, Dirac notation is introduced and explained.

Vectors

Here we introduce vectors and the notation that we use for vectors. We then give some facts about real vectors before discussing the complex vectors used in quantum mechanics.

Vectors Defining and Representing

You may have heard of the definition of a vector as a quantity with both magnitude and direction. While this is true and often used in science classes, our purpose is different. So we will simply define a vector as an array of numbers that is written in a single row or a single column. When the vector is written as a row of numbers, it is called a row vector and when it is written in a column of numbers, it is called a column vector. As we will see, these two can have the same set of numbers, but each can be used in a slightly different way.

Examples of Vectors

This is an example of a row vector ${\vec {v}}=(2,4,3).$

This is an example of a column vector ${\vec {w}}=\left({\begin{array}{c}1\\5\\4\end{array}}\right).$

Real Vectors

If you are familiar with vectors, the simple definition of a vector --- an object that has magnitude and direction --- is helpful to keep in mind even when dealing with complex vectors (vectors with complex, i.e., imaginary numbers as entries) as we will here. However, this is not necessary and we will see how to perform all of the operations that we need just using arrays of numbers that we call vectors. In three dimensional space, a vector is often written as

{\vec {v}}=v_{x}{\hat {x}}+v_{y}{\hat {y}}+v_{z}{\hat {z}},

where the hat ( ${\hat {\cdot }}\,\!$ ) denotes a unit vector and the components $v_{i}\,\!$ , $i=x,y,z\,\!$ are just numbers. The unit vectors are also known as basis vectors. This is because any vector in real three-dimensional space can be written in terms of these unit/basis vectors. In this vector, one can associate a point where the coordinate of the point is $(x,y,z)$ . That is, a point a distance $x$ from the origin along the $x$ -axis, a distance $y$ from the origin along the $y$ -axis, and a distance $z$ from the origin along the $z$ -axis. In some sense, unit vectors are the basic components of any vector. Other basis vectors could be used. But this will be discussed elsewhere.

The Magnitude or Length of a Vector

Vectors can be associated with a point in a space. The vector can then be represented as a line from the origin to the point. Each point can be associated with exactly one vector. The distance from the origin to the point is called the magnitude or length of the vector. For example, if we take the vector ${\vec {v}}=(2,4,3),$ the magnitude, or length, represented by $|{\vec {v}}|.$ can be calculated as

|{\vec {v}}|={\sqrt {2^{2}+4^{2}+3^{2}}}={\sqrt {29}}.

Vector Operations

To illustrate vector operations, let

{\vec {v}}=\left({\begin{array}{c}v_{1}\\v_{2}\end{array}}\right),{\mbox{  and  }}{\vec {w}}=\left({\begin{array}{c}w_{1}\\w_{2}\end{array}}\right).

Vector Addition

Vectors can be added. To do this, each element of one vector is added to the corresponding element of the other vector. In general, for a row vector, they add as

{\vec {v}}+{\vec {w}}=\left({\begin{array}{c}v_{1}\\v_{2}\end{array}}\right)+\left({\begin{array}{c}w_{1}\\w_{2}\end{array}}\right)=\left({\begin{array}{c}v_{1}+w_{1}\\v_{2}+w_{2}\end{array}}\right).

The addition of row vectors is similar. They are added component by component. Suppose we had

{\vec {s}}=\left(s_{1},s_{2}\right),{\mbox{  and  }}{\vec {u}}=\left(u_{1},u_{2}\right).

Then

{\vec {s}}+{\vec {u}}=\left(s_{1},s_{2}\right)+\left(u_{1},u_{2}\right)=\left(s_{1}+u_{1},s_{2}+u_{2}\right).

Since the way vectors are added is component to corresponding component, it doesn't make any sense to add a column vector and a row vector. It also doesn't make any sense to add a vector with two components to one with three components.

Example

Adding two vectors ${\vec {v}}_{1}\,\!$ and ${\vec {w}}_{1}\,\!$ with

{\vec {v}}_{1}=\left({\begin{array}{c}1\\5\end{array}}\right),\;\;{\vec {w}}_{1}=\left({\begin{array}{c}4\\2\end{array}}\right),\,\!

we get

{\vec {v}}_{1}+{\vec {w}}_{1}=\left({\begin{array}{c}1\\5\end{array}}\right)+\left({\begin{array}{c}4\\2\end{array}}\right)=\left({\begin{array}{c}5\\7\end{array}}\right).\,\!

Multiplication by a Number

When a vector is multiplied by a number, each component is multiplied by that same number. For example, suppose

{\vec {v}}=\left({\begin{array}{c}v_{1}\\v_{2}\end{array}}\right).

Then

a{\vec {v}}=a\left({\begin{array}{c}v_{1}\\v_{2}\end{array}}\right)=\left({\begin{array}{c}av_{1}\\av_{2}\end{array}}\right).

Notice that if $a$ is positive, then the magnitude of the vector is

|a{\vec {v}}|={\sqrt {(av_{1})^{2}+(av_{1})^{2}}}={\sqrt {a^{2}(v_{1}^{2}+v_{2}^{2})}}=a|{\vec {v}}|.

So multiplying a vector by a number just changes the length, or magnitude, of the vector is $a$ is positive. If $a$ is negative, it changes the directions of the components and therefore of the vector itself and also changes the magnitude.

Products of Two Vectors

Inner Products

The inner product, or dot product, for two real three-dimensional vectors,

{\vec {v}}=v_{x}{\hat {x}}+v_{y}{\hat {y}}+v_{z}{\hat {z}},\;\;{\vec {w}}=w_{x}{\hat {x}}+w_{y}{\hat {y}}+w_{z}{\hat {z}},

can be computed as follows:

{\vec {v}}\cdot {\vec {w}}=v_{x}w_{x}+v_{y}w_{y}+v_{z}w_{z}.

For the inner product of ${\vec {v}}\,\!$ with itself, we get the square of the magnitude of ${\vec {v}}\,\!$ , denoted $|{\vec {v}}|^{2}\,\!$ :

|{\vec {v}}|^{2}={\vec {v}}\cdot {\vec {v}}=v_{x}v_{x}+v_{y}v_{y}+v_{z}v_{z}=v_{x}^{2}+v_{y}^{2}+v_{z}^{2}.

If we want a unit vector in the direction of ${\vec {v}}\,\!$ , we can simply divide it by its magnitude:

{\hat {v}}={\frac {\vec {v}}{|{\vec {v}}|}}.

Now, of course, ${\hat {v}}\cdot {\hat {v}}=1\,\!$ , which can easily be checked.

There are several ways to represent a vector. The ones we will use most often are column and row vector notations. So, for example, we could write the vector above as

{\vec {v}}=\left({\begin{array}{c}v_{x}\\v_{y}\\v_{z}\end{array}}\right).

In this case, our unit vectors are represented by the following:

{\hat {x}}=\left({\begin{array}{c}1\\0\\0\end{array}}\right),\;\;{\hat {y}}=\left({\begin{array}{c}0\\1\\0\end{array}}\right),\;\;{\hat {z}}=\left({\begin{array}{c}0\\0\\1\end{array}}\right).\,\!

We next turn to the subject of complex vectors and the relevant notation. We will see how to compute the inner product later, since some other definitions are required.

Complex Vectors

A complex vector is a vector that has complex entries in each spot in the array. Above, the real vectors had real numbers in the array. Below, (for example the next equation), the vector components are complex numbers. In other words, $v_{x}\,\!$ can be written as $x+iy\,\!$ .

For complex vectors in quantum mechanics, Dirac notation is used most often. This notation uses a $\left\vert \cdot \right\rangle \,\!$ , called a ket, for a vector. So our vector ${\vec {v}}\,\!$ would be

\left\vert v\right\rangle =\left({\begin{array}{c}v_{x}\\v_{y}\\v_{z}\end{array}}\right).

For qubits, i.e. two-state quantum systems, complex vectors will often be used:


	${\begin{aligned}\left\vert \psi \right\rangle &=\left({\begin{array}{c}\alpha \\\beta \end{array}}\right)\\&=\alpha \left\vert 0\right\rangle +\beta \left\vert 1\right\rangle ,\end{aligned}}$	(C.1)

where

\left\vert 0\right\rangle =\left({\begin{array}{c}1\\0\end{array}}\right),\;\;{\mbox{and}}\;\;\left\vert 1\right\rangle =\left({\begin{array}{c}0\\1\end{array}}\right)

are the basis vectors. The two numbers $\alpha \,\!$ and $\beta \,\!$ are complex numbers, so the vector is said to be a complex vector.

Matrices

Basic Definition and Representations

A matrix is an array of numbers of the following form with columns, col. 1, col. 2, etc., and rows, row 1, row 2, etc. The entries for the matrix are labeled by the row and column. So the entry of a matrix $A$ will be $a_{ij}$ where $i$ is the row and $j$ is the column where the number $a_{ij}$ is found. This is how it looks:

A={\begin{array}{c}{\scriptstyle {row\;1}}\\{\scriptstyle {row\;2}}\\\vdots \\{\scriptstyle {row\;m}}\end{array}}{\overset {{\scriptstyle {col.\;1}}\;\;\;{\scriptstyle {col.\;2}}\;\;\;\;{\displaystyle {\cdots }}\;\;\;\;\;{\scriptstyle {col.\;n}}}{\left({\begin{array}{cccc}a_{11}&a_{12}&\cdots &a_{1n}\\a_{21}&a_{22}&\cdots &a_{2n}\\\vdots &\vdots &\ddots &\vdots \\a_{m1}&a_{m2}&\cdots &a_{mn}\end{array}}\right)}}.\,\!

Notice that we represent the whole matrix with a capital letter $A$ . We could also represent it using all of the entries, this array of numbers seen in the equation above. Another way to represent it is to write it as $(a_{ij})$ . By this we mean that it is the array of numbers in the parentheses.

Matrix Addition

Matrix addition is performed by adding each element of one matrix with the corresponding element in another matrix. Let our two matrices be $A$ as above, and $B$ . To represent these in an array,

A=\left({\begin{array}{cccc}a_{11}&a_{12}&\cdots &a_{1n}\\a_{21}&a_{22}&\cdots &a_{2n}\\\vdots &\vdots &\ddots &\vdots \\a_{m1}&a_{m2}&\cdots &a_{mn}\end{array}}\right),\;\;\;\;B=\left({\begin{array}{cccc}b_{11}&b_{12}&\cdots &b_{1n}\\b_{21}&b_{22}&\cdots &b_{2n}\\\vdots &\vdots &\ddots &\vdots \\b_{m1}&b_{m2}&\cdots &b_{mn}\end{array}}\right).\,\!

The the sum, which we could call $C=A+B$ is given by

A+B=\left({\begin{array}{cccc}a_{11}&a_{12}&\cdots &a_{1n}\\a_{21}&a_{22}&\cdots &a_{2n}\\\vdots &\vdots &\ddots &\vdots \\a_{m1}&a_{m2}&\cdots &a_{mn}\end{array}}\right)+\left({\begin{array}{cccc}b_{11}&b_{12}&\cdots &b_{1n}\\b_{21}&b_{22}&\cdots &b_{2n}\\\vdots &\vdots &\ddots &\vdots \\b_{m1}&b_{m2}&\cdots &b_{mn}\end{array}}\right)=\left({\begin{array}{cccc}a_{11}+b_{11}&a_{12}+b_{12}&\cdots &a_{1n}+b_{1n}\\a_{21}+b_{21}&a_{22}+b_{22}&\cdots &a_{2n}+b_{2n}\\\vdots &\vdots &\ddots &\vdots \\a_{m1}+b_{m1}&a_{m2}+b_{m2}&\cdots &a_{mn}+b_{mn}\end{array}}\right).\,\!

In other words, the sum gives $c_{11}=a_{11}+b_{11}\,\!$ , etc. We add them component by component like we do vectors.

Notation

There are many aspects of linear algebra that are quite useful in quantum mechanics. We will briefly discuss several of these aspects here. First, some definitions and properties are provided that will be useful. Some familiarity with matrices will be assumed, although many basic definitions are also included.

Let us denote some $m\times n\,\!$ matrix by $A\,\!$ . The set of all $m\times n\,\!$ matrices with real entries is $M(m\times n,\mathbb {R} )\,\!$ . Such matrices are said to be real since they have all real entries. Similarly, the set of $m\times n\,\!$ complex matrices is $M(m\times n,\mathbb {C} )\,\!$ . For the set of square $n\times n\,\!$ complex matrices, we simply write $M(n,\mathbb {C} )\,\!$ .

We will also refer to the set of matrix elements, $a_{ij}\,\!$ , where the first index ( $i\,\!$ in this case) labels the row and the second $(j)\,\!$ labels the column. Thus the element $a_{23}\,\!$ is the element in the second row and third column. A comma is inserted if there is some ambiguity. For example, in a large matrix the element in the 2nd row and 12th column is written as $a_{2,12}\,\!$ to distinguish between the 21st row and 2nd column.

The Identity Matrix

An identity matrix has the property that when it is multiplied by any matrix, that matrix is unchanged. That is, for any matrix $A\,\!$ ,

\mathbb {I} A=A\mathbb {I} =A.\,\!

Such an identity matrix always has ones along the diagonal and zeroes everywhere else. For example, the $3\times 3\,\!$ identity matrix is

\mathbb {I} =\left({\begin{array}{ccc}1&0&0\\0&1&0\\0&0&1\end{array}}\right).\,\!

It is straight-forward to verify that any $3\times 3\,\!$ matrix is not changed when multiplied by the identity matrix.

Complex Conjugate

The complex conjugate of a matrix is the matrix with each element replaced by its complex conjugate. In other words, to take the complex conjugate of a matrix, one takes the complex conjugate of each entry in the matrix. We denote the complex conjugate with a star, like this: $A^{*}\,\!$ . For example,


	${\begin{aligned}A^{}&=&\left({\begin{array}{ccc}a_{11}&a_{12}&a_{13}\\a_{21}&a_{22}&a_{23}\\a_{31}&a_{32}&a_{33}\end{array}}\right)^{}\\&=&\left({\begin{array}{ccc}a_{11}^{}&a_{12}^{}&a_{13}^{}\\a_{21}^{}&a_{22}^{}&a_{23}^{}\\a_{31}^{}&a_{32}^{}&a_{33}^{*}\end{array}}\right).\end{aligned}}\,\!$	(C.2)

(Notice that the notation for a matrix is a capital letter, whereas the entries are represented by lower case letters.)

Transpose

The transpose of a matrix is the same set of elements, but now the first row becomes the first column, the second row becomes the second column, and so on. Thus the rows and columns are interchanged. For example, for a square $3\times 3\,\!$ matrix, the transpose is given by


	${\begin{aligned}A^{T}&=&\left({\begin{array}{ccc}a_{11}&a_{12}&a_{13}\\a_{21}&a_{22}&a_{23}\\a_{31}&a_{32}&a_{33}\end{array}}\right)^{T}\\&=&\left({\begin{array}{ccc}a_{11}&a_{21}&a_{31}\\a_{12}&a_{22}&a_{32}\\a_{13}&a_{23}&a_{33}\end{array}}\right).\end{aligned}}\,\!$	(C.3)

Hermitian Conjugate

The complex conjugate and transpose of a matrix is called the Hermitian conjugate, or simply the dagger of a matrix. It is called the dagger because the symbol used to denote it, ( $\dagger \,\!$ ):


	$(A^{T})^{}=(A^{})^{T}\equiv A^{\dagger }.\,\!$	(C.4)

For our $3\times 3\,\!$ example,

A^{\dagger }=\left({\begin{array}{ccc}a_{11}^{*}&a_{21}^{*}&a_{31}^{*}\\a_{12}^{*}&a_{22}^{*}&a_{32}^{*}\\a_{13}^{*}&a_{23}^{*}&a_{33}^{*}\end{array}}\right).\,\!

If a matrix is its own Hermitian conjugate, i.e. $A^{\dagger }=A\,\!$ , then we call it a Hermitian matrix. (Clearly this is only possible for square matrices.) Hermitian matrices are very important in quantum mechanics since their eigenvalues are real. (See Sec.(Eigenvalues and Eigenvectors).)

The Inverse of a Matrix

Index Notation

Very often we write the product of two matrices $A\,\!$ and $B\,\!$ simply as $AB\,\!$ and let $C=AB\,\!$ . However, it is also quite useful to write this in component form. In this case, if these are $n\times n\,\!$ matrices, the component form will be

c_{ik}=\sum _{j=1}^{n}a_{ij}b_{jk}.\,\!

This says that the element in the $i^{\mbox{th}}\,\!$ row and $j^{\mbox{th}}\,\!$ column of the matrix $C\,\!$ is the sum $\sum _{j=1}^{n}a_{ij}b_{jk}\,\!$ . The transpose of $C\,\!$ has elements

c_{ki}=\sum _{j=1}^{n}a_{kj}b_{ji}.\,\!

Now if we were to transpose $A\,\!$ and $B\,\!$ as well, this would read

c_{ki}=\sum _{j=1}^{n}(a_{jk})^{T}(b_{ij})^{T}=\sum _{j=1}^{n}b_{ij}^{T}a_{jk}^{T}.\,\!

This gives us a way of seeing the general rule that

C^{T}=B^{T}A^{T}.\,\!

It follows that

C^{\dagger }=B^{\dagger }A^{\dagger }.\,\!

The Trace

The trace of a matrix is the sum of the diagonal elements and is denoted ${\mbox{Tr}}\,\!$ . So for example, the trace of an $n\times n\,\!$ matrix $A\,\!$ is

{\mbox{Tr}}(A)=\sum _{i=1}^{n}a_{ii}\,\!

.

Some useful properties of the trace are the following:

${\mbox{Tr}}(AB)={\mbox{Tr}}(BA)\,\!$
${\mbox{Tr}}(A+B)={\mbox{Tr}}(A)+{\mbox{Tr}}(B)\,\!$ .

Using the first of these results,

{\mbox{Tr}}(UAU^{-1})={\mbox{Tr}}(U^{-1}UA)={\mbox{Tr}}(A).\,\!

This relation is used so often that we state it here explicitly.

The Determinant

For a square matrix, the determinant is quite a useful thing. For example, an $n\times n\,\!$ matrix is invertible if and only if its determinant is not zero. So let us define the determinant and give some properties and examples.

The determinant of a $2\times 2\,\!$ matrix,


	$N=\left({\begin{array}{cc}a&b\\c&d\end{array}}\right),\,\!$	(C.5)

is given by


	$\det(N)=ad-bc.\,\!$	(C.6)

Higher-order determinants can be written in terms of smaller ones in a recursive way. For example, let

$M=\left({\begin{array}{ccc}m_{11}&m_{12}&m_{13}\\m_{21}&m_{22}&m_{23}\\m_{31}&m_{32}&m_{33}\end{array}}\right).\,\!$

Then

$\det(M)=m_{11}\det \left({\begin{array}{cc}m_{22}&m_{23}\\m_{32}&m_{33}\end{array}}\right)-m_{12}\det \left({\begin{array}{cc}m_{21}&m_{23}\\m_{31}&m_{33}\end{array}}\right)+m_{13}\det \left({\begin{array}{cc}m_{21}&m_{22}\\m_{31}&m_{32}\end{array}}\right).\,\!$

The determinant of a matrix $A\,\!$ can be also be written in terms of its components as


	$\det(A)=\sum _{i,j,k,l,...}\epsilon _{ijkl...}a_{1i}a_{2j}a_{3k}a_{4l}...,\,\!$	(C.7)

where the symbol


	$\epsilon _{ijkl...}={\begin{cases}+1,\;{\mbox{if }}\;ijkl...=1234...({\mbox{in order, or any even number of permutations}}),\\-1,\;{\mbox{if }}\;ijkl...=2134...({\mbox{or any odd number of permutations}}),\\\;\;\;0,\;{\mbox{otherwise}},\;({\mbox{meaning any index is repeated}}).\end{cases}}\,\!$	(C.8)

Let us consider the example of the $3\times 3\,\!$ matrix $A\,\!$ given above. The determinant can be calculated by

\det(A)=\sum _{i,j,k}\epsilon _{ijk}a_{1i}a_{2j}a_{3k},\,\!

where, explicitly,


	$\epsilon _{ijk}={\begin{cases}+1,\;{\mbox{if }}\;ijk=123,231,\;{\mbox{or}}\;312,({\mbox{These are even permutations of }}123),\\-1,\;{\mbox{if }}\;ijk=213,132,\;{\mbox{or}}\;321({\mbox{These are odd permuations of }}123),\\\;\;\;0,\;{\mbox{otherwise}},\;({\mbox{meaning any index is repeated}}),\end{cases}}\,\!$	(C.9)

so that


	${\begin{aligned}\det(A)&=&\epsilon _{123}a_{11}a_{22}a_{33}+\epsilon _{132}a_{11}a_{23}a_{32}+\epsilon _{231}a_{12}a_{23}a_{31}\\&&+\epsilon _{213}a_{12}a_{21}a_{33}+\epsilon _{312}a_{13}a_{21}a_{32}+\epsilon _{321}a_{13}a_{22}a_{31}.\end{aligned}}\,\!$	(C.10)

Now given the values of $\epsilon _{ijk}\,\!$ in Eq. C.9, this is

\det(A)=a_{11}a_{22}a_{33}-a_{11}a_{23}a_{32}+a_{12}a_{23}a_{31}-a_{12}a_{21}a_{33}+a_{13}a_{21}a_{32}-a_{13}a_{22}a_{31}.\,\!

The determinant has several properties that are useful to know. A few are listed here:

The determinant of the transpose of a matrix is the same as the determinant of the matrix itself: $\det(A)=\det(A^{T}).\,\!$
The determinant of a product is the product of determinants: $\det(AB)=\det(A)\det(B).\,\!$

From this last property, another specific property can be derived. If we take the determinant of the product of a matrix and its inverse, we find

\det(UU^{-1})=\det(U)\det(U^{-1})=\det(\mathbb {I} )=1,\,\!

since the determinant of the identity is one. This implies that

\det(U^{-1})={\frac {1}{\det(U)}}.\,\!

The Inverse of a Matrix

The inverse of a square matrix $A\,\!$ is another matrix, denoted $A^{-1}\,\!$ , such that

AA^{-1}=A^{-1}A=\mathbb {I} ,\,\!

where $\mathbb {I} \,\!$ is the identity matrix consisting of zeroes everywhere except the diagonal, which has ones. For example, the $3\times 3\,\!$ identity matrix is

\mathbb {I} _{3}=\left({\begin{array}{ccc}1&0&0\\0&1&0\\0&0&1\end{array}}\right).\,\!

It is important to note that a matrix is invertible if and only if its determinant is nonzero. Thus one only needs to calculate the determinant to see if a matrix has an inverse or not.

Hermitian Matrices

Hermitian matrices are important for a variety of reasons; primarily, it is because their eigenvalues are real. Thus Hermitian matrices are used to represent density operators and density matrices, as well as Hamiltonians. The density operator is a positive semi-definite Hermitian matrix (it has no negative eigenvalues) that has its trace equal to one. In any case, it is often desirable to represent $N\times N\,\!$ Hermitian matrices using a real linear combination of a complete set of $N\times N\,\!$ Hermitian matrices. A set of $N\times N\,\!$ Hermitian matrices is complete if any Hermitian matrix can be represented in terms of the set. Let $\{\lambda _{i}\}\,\!$ be a complete set. Then any Hermitian matrix can be represented by $\sum _{i}a_{i}\lambda _{i}\,\!$ . The set can always be taken to be a set of traceless Hermitian matrices and the identity matrix. This is convenient for the density matrix (its trace is one) because the identity part of an $N\times N\,\!$ Hermitian matrix is $(1/N)\mathbb {I} \,\!$ if we take all others in the set to be traceless. For the Hamiltonian, the set consists of a traceless part and an identity part where identity part just gives an overall phase which can often be neglected.

One example of such a set which is extremely useful is the set of Pauli matrices. These are discussed in detail in Chapter 2 and in particular in Section 2.4.

Unitary Matrices

A unitary matrix $U\,\!$ is one whose inverse is also its Hermitian conjugate, $U^{\dagger }=U^{-1}\,\!$ , so that

U^{\dagger }U=UU^{\dagger }=\mathbb {I} .\,\!

If the unitary matrix also has determinant one, it is said to be a special unitary matrix. The set of $n\times n\,\!$ unitary matrices is denoted $U(n)\,\!$ and the set of special unitary matrices is denoted $SU(n)\,\!$ .

Unitary matrices are particularly important in quantum mechanics because they describe the evolution of quantum states. They have this ability due to the fact that the rows and columns of unitary matrices (viewed as vectors) are orthonormal. (This is made clear in an example below.) This means that when they act on a basis vector of the form


	$\left\vert j\right\rangle =\left({\begin{array}{c}0\\0\\\vdots \\1\\\vdots \\0\end{array}}\right),\,\!$	(C.11)

with a single 1, in say the $j$ th spot, and zeroes everywhere else, the result is a normalized complex vector. Acting on a set of orthonormal vectors of the form given in Eq.(C.11) will produce another orthonormal set.

Let us consider the example of a $2\times 2\,\!$ unitary matrix,


	$U=\left({\begin{array}{cc}a&b\\c&d\end{array}}\right).\,\!$	(C.12)

The inverse of this matrix is the Hermitian conjugate,


	$U^{-1}=U^{\dagger }=\left({\begin{array}{cc}a^{}&c^{}\\b^{}&d^{}\end{array}}\right),\,\!$	(C.13)

provided that the matrix $U\,\!$ satisfies the constraints


	${\begin{aligned}\|a\|^{2}+\|b\|^{2}=1,\;&\;ac^{}+bd^{}=0\\ca^{}+db^{}=0,\;&\;\|c\|^{2}+\|d\|^{2}=1,\end{aligned}}\,\!$	(C.14)

and


	${\begin{aligned}\|a\|^{2}+\|c\|^{2}=1,\;&\;ba^{}+dc^{}=0\\b^{}a+d^{}c=0,\;&\;\|b\|^{2}+\|d\|^{2}=1.\end{aligned}}\,\!$	(C.15)

Looking at each row as a vector, the constraints in Eq.(C.14) are the orthonormality conditions for the vectors forming the rows. Similarly, the constraints in Eq.(C.15) are the orthonormality conditions for the vectors forming the columns.

Inner and Outer Products

Now that we have a definition for the Hermitian conjugate, we consider the case for a $1\times n\,\!$ matrix, i.e. a vector. In Dirac notation, this is

\left\vert \psi \right\rangle =\left({\begin{array}{c}\alpha \\\beta \end{array}}\right).\,\!

The Hermitian conjugate comes up so often that we use the following notation for vectors:

\left\langle \psi \right\vert =(\left\vert \psi \right\rangle )^{\dagger }=\left({\begin{array}{c}\alpha \\\beta \end{array}}\right)^{\dagger }=\left(\alpha ^{*},\;\beta ^{*}\right).\,\!

This is a row vector and in Dirac notation is denoted by the symbol $\left\langle \cdot \right\vert \!$ , which is called a bra. Let us consider a second complex vector,

\left\vert \phi \right\rangle =\left({\begin{array}{c}\gamma \\\delta \end{array}}\right).\,\!

The inner product between $\left\vert \psi \right\rangle \,\!$ and $\left\vert \phi \right\rangle \,\!$ is computed as follows:


	${\begin{aligned}\left\langle \phi \mid \psi \right\rangle &\equiv (\left\vert \phi \right\rangle )^{\dagger }\left\vert \psi \right\rangle \\&=(\gamma ^{},\delta ^{})\left({\begin{array}{c}\alpha \\\beta \end{array}}\right)\\&=\gamma ^{}\alpha +\delta ^{}\beta .\end{aligned}}$	(C.16)

The outer product between these same two vectors is

{\begin{aligned}(\left\vert \phi \right\rangle )(\left\vert \psi \right\rangle )^{\dagger }&=\left\vert \phi \right\rangle \left\langle \psi \right\vert \\&=\left({\begin{array}{c}\gamma \\\delta \end{array}}\right)\left({\begin{array}{c}\alpha \\\beta \end{array}}\right)^{\dagger }\\&=\left({\begin{array}{c}\gamma \\\delta \end{array}}\right)\left({\begin{array}{cc}\alpha ^{*}&\beta ^{*}\end{array}}\right)\\&=\left({\begin{array}{cc}\gamma \alpha ^{*}&\gamma \beta ^{*}\\\delta \alpha ^{*}&\delta \beta ^{*}\end{array}}\right)\end{aligned}}\;\!

More Dirac Notation

If these two vectors are orthogonal, then their inner product is zero, or $\left\langle \phi \mid \psi \right\rangle =0\,\!$ . (The $\left\langle \phi \mid \psi \right\rangle \,\!$ is called a bracket, which is the product of the bra and the ket.) The inner product of $\left\vert \psi \right\rangle \,\!$ with itself is

\left\langle \psi \mid \psi \right\rangle =|\alpha |^{2}+|\beta |^{2}.\,\!

This vector is considered normalized when $\left\langle \psi \mid \psi \right\rangle =1\,\!$ .

More generally, we will consider vectors in $N\,\!$ dimensions. In this case we write the vector in terms of a set of basis vectors, $\{\left\vert i\right\rangle \}\,\!$ , where $i=0,1,2,...N-1\,\!$ . This is an ordered set of vectors which are labeled simply by integers. If the set is orthogonal, then

\left\langle i\mid j\right\rangle =0,\;\;{\mbox{for all }}i\neq j.\,\!

If they are normalized, then

\left\langle i\mid i\right\rangle =1,\;\;{\mbox{for all }}i.\,\!

If both of these are true, i.e. the entire set is orthonormal, we can write

\left\langle i\mid j\right\rangle =\delta _{ij}\,\!

where the symbol $\delta _{ij}\,\!$ is called the Kronecker delta and is defined by


	$\delta _{ij}={\begin{cases}1,&{\mbox{if }}i=j,\\0,&{\mbox{if }}i\neq j.\end{cases}}$	(C.17)

Now consider $(N+1)\,\!$ -dimensional vectors by letting two such vectors be expressed in the same basis as

\left\vert \Psi \right\rangle =\sum _{i=0}^{N}\alpha _{i}\left\vert i\right\rangle \,\!

and

\left\vert \Phi \right\rangle =\sum _{j=0}^{N}\beta _{j}\left\vert j\right\rangle .\,\!

Then the inner product is


	${\begin{aligned}\left\langle \Psi \mid \Phi \right\rangle &=\left(\sum _{i=0}^{N}\alpha _{i}\left\vert i\right\rangle \right)^{\dagger }\left(\sum _{j=0}^{N}\beta _{j}\left\vert j\right\rangle \right)\\&=\sum _{ij}\alpha _{i}^{}\beta _{j}\left\langle i\mid j\right\rangle \\&=\sum _{ij}\alpha _{i}^{}\beta _{j}\delta _{ij}\\&=\sum _{i}\alpha _{i}^{*}\beta _{i},\end{aligned}}$	(C.18)

where the fact that the delta function is zero unless $i=j\,\!$ is used to obtain the last equality. Taking the inner product of a vector with itself will get

\left\langle \Psi \mid \Psi \right\rangle =\sum _{i}\alpha _{i}^{*}\alpha _{i}=\sum _{i}|\alpha _{i}|^{2}.\,\!

This immediately gives us a very important property of the inner product. It tells us that, in general,

\left\langle \Phi \mid \Phi \right\rangle \geq 0,\;\;{\mbox{and}}\;\;\left\langle \Phi \mid \Phi \right\rangle =0\Leftrightarrow \left\vert \Phi \right\rangle =0.\,\!

(The symbol $\Leftrightarrow \,\!$ means "if and only if," sometimes written as "iff.")

We could also expand a vector in a different basis. Let us suppose that the set $\{\left\vert e_{k}\right\rangle \}\,\!$ is an orthonormal basis $(\left\langle e_{k}\mid e_{l}\right\rangle =\delta _{kl})\,\!$ that is different from the one considered earlier. We could expand our vector $\left\vert \Psi \right\rangle \,\!$ in terms of our new basis by expanding our new basis in terms of our old basis. Let us first expand the $\left\vert e_{k}\right\rangle \,\!$ in terms of the $\left\vert j\right\rangle \,\!$ :


	$\left\vert e_{k}\right\rangle =\sum _{j}\left\vert j\right\rangle \left\langle j\mid e_{k}\right\rangle ,$	(C.19)

so that


	${\begin{aligned}\left\vert \Psi \right\rangle &=\sum _{j}\alpha _{j}\left\vert j\right\rangle \\&=\sum _{j}\sum _{k}\alpha _{j}\left\vert e_{k}\right\rangle \left\langle e_{k}\mid j\right\rangle \\&=\sum _{k}\alpha _{k}^{\prime }\left\vert e_{k}\right\rangle ,\end{aligned}}$	(C.20)

where


	$\alpha _{k}^{\prime }=\sum _{j}\alpha _{j}\left\langle e_{k}\mid j\right\rangle .$	(C.21)

Notice that the insertion of $\sum _{k}\left\vert e_{k}\right\rangle \left\langle e_{k}\right\vert \,\!$ didn't do anything to our original vector; it is the same vector, just in a different basis. Therefore, this is effectively the identity operator,

\mathbb {I} =\sum _{k}\left\vert e_{k}\right\rangle \left\langle e_{k}\right\vert .\,\!

This is an important and quite useful relation. To interpret Eq.(C.19), we can draw a close analogy with three-dimensional real vectors. The inner product $\left\vert e_{k}\right\rangle \left\langle j\right\vert \,\!$ can be interpreted as the projection of one vector onto another. This provides the part of $\left\vert j\right\rangle \,\!$ along $\left\vert e_{k}\right\rangle \,\!$ .

Transformations

Suppose we have two different orthogonal bases, $\{e_{k}\}\,\!$ , $\{j\}\,\!$ . The numbers $\left\langle e_{k}\mid j\right\rangle \,\!$ for all the different $k\,\!$ and $j\,\!$ are often referred to as matrix elements since the set forms a matrix, with $k\,\!$ labelling the rows and $j\,\!$ labelling the columns. Thus we can represent the transformation from one basis to another with a matrix transformation. Let $M\,\!$ be the matrix with elements $m_{kj}=\left\langle e_{k}\mid j\right\rangle \,\!$ . The transformation from one basis to another, written in terms of the coefficients of $\left\vert \Psi \right\rangle \,\!$ , is


	$A^{\prime }=MA,$	(C.22)

where

A^{\prime }=\left({\begin{array}{c}\alpha _{1}^{\prime }\\\alpha _{2}^{\prime }\\\vdots \\\alpha _{n}^{\prime }\end{array}}\right),\;\;{\mbox{ and }}\;\;A=\left({\begin{array}{c}\alpha _{1}\\\alpha _{2}\\\vdots \\\alpha _{n}\end{array}}\right).\,\!

This sort of transformation is a change of basis. Oftentimes when one vector is transformed to another the transformation can be viewed as a transformation of the components of the vector and is also represented by a matrix. Thus transformations can either be represented by the matrix equation, like Eq.(C.22), or the components,


	$\alpha _{k}^{\prime }=\sum _{j}\alpha _{j}\left\langle e_{k}\mid j\right\rangle =\sum _{j}m_{kj}\alpha _{j}.$	(C.23)

In the case that we consider a matrix transformation of basis elements, we call it a passive transformation. (The transformation does nothing to the object, but only changes the basis in which the object is described.) An active transformation is one where the object itself is transformed. Often these two transformations, active and passive, are very simply related. However, the distinction can be very important.

For a general transformation matrix $T\,\!$ acting on a vector, the matrix elements in a particular basis $\left\vert i\right\rangle \,\!$ are

t_{ij}=\left\langle i\right\vert (T)\left\vert j\right\rangle ,\,\!

just as elements of a vector can be found using

\left\langle i\mid \Psi \right\rangle =\alpha _{i}.\,\!

Transformations of a Qubit

It is worth belaboring the point somewhat and presenting several ways in which to parametrize the set of transformations of a qubit. A qubit state is represented by a complex two-dimensional vector that has been normalized to one:

\left\vert \psi \right\rangle =\alpha _{0}\left\vert 0\right\rangle +\alpha _{1}\left\vert 1\right\rangle =\left({\begin{array}{c}\alpha _{0}\\\alpha _{1}\end{array}}\right),\;\;\;\;|\alpha _{0}|^{2}+|\alpha _{1}|^{2}=1.\,\!

The most general matrix transformation that will take this to any other state of the same form (complex, 2-d vector with unit norm) is a $2\times 2\,\!$ unitary matrix. In Chapter 2, several specific examples of qubit transformations were given; in Chapter 3, Section 3.4 it was stated that an element of SU(2) can be written as (see Section 3.2.1, Exponentiation of a Matrix, in particular Eq. (3.8))


	${\begin{aligned}U(\theta )&=\exp(-i{\vec {n}}\cdot {\vec {\sigma }}\theta /2)\\&=(\mathbb {I} \cos(\theta /2)-i{\vec {n}}\cdot {\vec {\sigma }}\sin(\theta /2))\end{aligned}}$	(C.24)

where ${\vec {n}}\,\!$ is a unit vector, $|{\vec {n}}|=1\,\!$ , and ${\vec {n}}\cdot {\vec {\sigma }}=n_{1}\sigma _{1}+n_{2}\sigma _{2}+n_{3}\sigma _{3}\,\!$ . Explicitly, this is

{\begin{aligned}\exp(-i{\vec {n}}\cdot {\vec {\sigma }}\theta /2)&=\left({\begin{array}{cc}1&0\\0&1\end{array}}\right)\cos(\theta /2)\\&\;\;\;+(-i)\left[n_{1}\left({\begin{array}{cc}0&1\\1&0\end{array}}\right)+n_{2}\left({\begin{array}{cc}0&-i\\i&0\end{array}}\right)+n_{3}\left({\begin{array}{cc}1&0\\0&-1\end{array}}\right)\right]\sin(\theta /2)\\&=\left({\begin{array}{cc}\cos(\theta /2)-in_{3}\sin(\theta /2)&(-in_{1}-n_{2})\sin(\theta /2)\\(-in_{1}+n_{2})\sin(\theta /2)&\cos(\theta /2)+in_{3}\sin(\theta /2)\end{array}}\right).\end{aligned}}

Let us prove this. First, using Eq. (3.7), we will need to find $({\hat {n}}\cdot {\vec {\sigma }})^{m}\,\!$ , i.e., all powers of $({\hat {n}}\cdot {\vec {\sigma }})\,\!$ . This turns out to be fairly easy. First note that ${\hat {n}}\cdot {\hat {n}}=1\,\!$ since ${\hat {n}}\,\!$ is a unit vector. Then note that $\sigma _{i}\sigma _{j}=\mathbb {I} \delta _{ij}+i\epsilon _{ijk}\sigma _{k},$ . (See Eq. (2.21) as well as Eqs. (C.17) and (C.8).) These imply that (recalling that $\sigma _{x}=\sigma _{1},\;\sigma _{y}=\sigma _{2},\;\;\sigma _{z}=\sigma _{3}$ ),

{\begin{aligned}({\vec {n}}\cdot {\vec {\sigma }})^{2}&=(n_{1}\sigma _{1}+n_{2}\sigma _{2}+n_{3}\sigma _{3})^{2}\\&=\left(\sum _{i}n_{i}\sigma _{i}\right)\left(\sum _{j}n_{j}\sigma _{j}\right)\\&=\sum _{ij}n_{i}n_{j}\sigma _{i}\sigma _{j}\\&=\sum _{ij}n_{i}n_{j}(\mathbb {I} \delta _{ij}+i\epsilon _{ijk}\sigma _{k})\\&=\sum _{ij}n_{i}n_{i}\mathbb {I} =\mathbb {I} .\end{aligned}}\,\!

To get the third line, one just uses Eq. (2.21). To see that $\sum _{ij}n_{i}n_{j}\epsilon _{ijk}\sigma _{k}\,\!$ is zero, note that $n_{i}n_{j}\,\!$ is symmetric in $i,j\,\!$ but $\epsilon _{ijk}\,\!$ is antisymmetric in $i,j\,\!$ . Another way to see this more explicitly is to write out the sum and notice that each term shows up with a + and - sign thus cancelling each other out. Therefore, all the even powers of $({\hat {n}}\cdot {\vec {\sigma }})\,\!$ are just equal to the identity matrix and all odd powers are just $({\hat {n}}\cdot {\vec {\sigma }})\,\!$ times the even parts. Thus the sum in Eq. (3.7) reduces to

{\begin{aligned}\exp(-i{\hat {n}}\cdot {\vec {\sigma }}\theta /2)&=\mathbb {I} \cos(\theta /2)+i({\hat {n}}\cdot {\vec {\sigma }})\sin(\theta /2).\;\;\;\;\square \end{aligned}}\,\!

Notice that this is a special unitary matrix. (See Section Unitary Matrices.) To see that this is the most general SU(2) matrix, one needs to verify that any complex $2\times 2\,\!$ unitary matrix can be written in this form. (One way to do this is to start with a generic matrix and impose the restrictions. Here one may simply convince oneself that this is general through observation by acting on basis vectors.) This is the most general qubit transformation and can be interpreted as a rotation about the axis ${\hat {n}}\,\!$ by an angle $\theta \,\!$ .

Another parametrization of this set of matrices is the following, called the Euler angle parametrization:


	$U_{EA}=\exp(-i\sigma _{z}\alpha /2)\exp(-i\sigma _{y}\beta /2)\exp(-i\sigma _{z}\gamma /2).\,\!$	(C.25)

In this case the matrices $\sigma _{z}\,\!$ and $\sigma _{y}\,\!$ are not unique. Any two of the three Pauli matrices (or one of each) may be chosen. This is quite simple, useful, and generalizable to SU(N) for N arbitrary. In the simple case of a qubit, one may convince oneself by acting on basis vectors as before. However, with a little thought, one may see that rotating to a position on the sphere by the first angle, followed by rotations using the other two, will provide for a general orientation of an object.

Similarity Transformation

A similarity transformation of an $n\times n\,\!$ matrix $A\,\!$ by an invertible matrix $S\,\!$ is $SAS^{-1}\,\!$ . There are (at least) two important things to note about similarity transformations:

Similarity transformations leave the trace of a matrix unchanged. This is shown explicitly in Section 3.5.
Similarity transformations leave the determinant of a matrix unchanged, or invariant. This is because $\det(SAS^{-1})=\det(S)\det(A)\det(S^{-1})=\det(S)\det(A){\frac {1}{\det(S)}}=\det(A).\,\!$
Simultaneous similarity transformations of matrices in an equation will leave the equation unchanged. Let $A^{\prime }=SAS^{-1}\,\!$ , $B^{\prime }=SBS^{-1}\,\!$ , and $C^{\prime }=SCS^{-1}\,\!$ . If $AB=C\,\!$ , then $A^{\prime }B^{\prime }=C^{\prime }\,\!$ , since $A^{\prime }B^{\prime }=SAS^{-1}SBS^{-1}=SABS^{-1}=SCS^{-1}=C^{\prime }\,\!$ . The two matrices $A^{\prime }\,\!$ and $A\,\!$ are said to be similar.

Polar Decomposition and Singular Value Decomposition

A decomposition of a matrix that is often useful is the polar decomposition. For any matrix, $A\,\!$ , there exists a unitary matrix $U\,\!$ and postive matrices $K\,\!$ and $L\,\!$ such that


	$A=UK=LU,\,\!$	(C.26)

where the postive operators $K\,\!$ and $L\,\!$ are $K\equiv {\sqrt {A^{\dagger }A}}\,\!$ and $L\equiv {\sqrt {AA^{\dagger }}}\,\!$ . Futhermore, if $A\,\!$ is invertible, then $U\,\!$ is unique.

A matrix can always be diagonalized. By this, it is meant that for every complex matrix $M\,\!$ there is a diagonal matrix $D\,\!$ such that


	$M=UDV,\,\!$	(C.27)

where $U\,\!$ and $V\,\!$ are unitary matrices. This form is called a singular value decomposition of the matrix and the entries of the diagonal matrix $D\,\!$ are called the singular values of the matrix $M\,\!$ . However, the singular values are not always easy to find.

For proofs, see NielsenChuang:book.

Eigenvalues and Eigenvectors

Diagonalization

For the special case that the matrix $M\,\!$ is Hermitian $(M^{\dagger }=M)\,\!$ , the matrix $M\,\!$ can be written as


	$M=UDU^{\dagger }$	(C.28)

where $U\,\!$ is unitary $(U^{-1}=U^{\dagger })\,\!$ . In this case the elements of the matrix $D\,\!$ are called eigenvalues. Very often eigenvalues are introduced as solutions to the equation

M\left\vert v\right\rangle =\lambda \left\vert v\right\rangle \,\!

where $\left\vert v\right\rangle \,\!$ is an eigenvector.

To find the eigenvalues and eigenvectors of a matrix $M\,\!$ , we follow a standard procedure which is to calculate


	$\det(\lambda \mathbb {I} -M)=0$	(C.29)

and then solve for $\lambda \,\!$ . The different solutions for $\lambda \,\!$ is the set of eigenvalues and is called the spectrum. Let the different eigenvalues be denoted by $\lambda _{i}\,\!$ , $i=1,2,...,n\,\!$ fo an $n\times n\,\!$ vector. If two eigenvalues are equal, we say the spectrum is degenerate. To find the eigenvectors, which correspond to different eigenvalues, the equation

M\left\vert v\right\rangle =\lambda _{i}\left\vert v\right\rangle \,\!

must be solved for each value of $i\,\!$ . Notice that this equation holds even if we multiply both sides by some complex number. This implies that an eigenvector can always be scaled. Usually they are normalized to obtain an orthonormal set. As we will see by example, degenerate eigenvalues require some care.

Example 1

Consider a $2\times 2\,\!$ Hermitian matrix


	$\sigma =\left({\begin{array}{cc}1+a&b-ic\\b+ic&1-a\end{array}}\right).$	(C.30)

To find the eigenvalues of this, we follow a standard procedure, which is to calculate


	$\det(\sigma -\lambda \mathbb {I} )=0,$	(C.31)

and solve for $\lambda \,\!$ . The eigenvalues of this matrix are given by

\det \left({\begin{array}{cc}1+a-\lambda &b-ic\\b+ic&1-a-\lambda \end{array}}\right)=0,\,\!

which implies that the eigenvalues are

\lambda _{\pm }=1\pm {\sqrt {a^{2}+b^{2}+c^{2}}}\,\!

and the eigenvectors are

v_{1}=\left({\begin{array}{c}i\left(-a+c+{\sqrt {a^{2}+4b^{2}-2ac+c^{2}}}\right)\\2b\end{array}}\right),v_{2}=\left({\begin{array}{c}i\left(-a+c-{\sqrt {a^{2}+4b^{2}-2ac+c^{2}}}\right)\\2b\end{array}}\right).\,\!

These expressions are useful for calculating properties of qubit states as will be seen in the text.

Example 2

Now consider a $3\times 3\,\!$ matrix,

N=\left({\begin{array}{ccc}1&-i&0\\i&1&0\\0&0&1\end{array}}\right).\,\!

First we calculate

\det \left({\begin{array}{ccc}1-\lambda &-i&0\\i&1-\lambda &0\\0&0&1-\lambda \end{array}}\right)=(1-\lambda )[(1-\lambda )^{2}-1].\,\!

This implies that the eigenvalues C.6 are

\lambda =1,0,{\mbox{ or }}2.\,\!

Let $\lambda _{1}=1\,\!$ , $\lambda _{0}=0\,\!$ , and $\lambda _{2}=2\,\!$ . To find eigenvectors, we calculate


	${\begin{aligned}Nv&=\lambda v,\\\left({\begin{array}{ccc}1&-i&0\\i&1&0\\0&0&1\end{array}}\right)\left({\begin{array}{c}v_{1}\\v_{2}\\v_{3}\end{array}}\right)&=\lambda \left({\begin{array}{c}v_{1}\\v_{2}\\v_{3}\end{array}}\right)\end{aligned}}$	(C.32)

for each $\lambda \,\!$ . For $\lambda =1\,\!$ , we get the following equations:


	${\begin{aligned}v_{1}-iv_{2}&=v_{1},\\iv_{1}+v_{2}&=v_{2},\\v_{3}&=v_{3}.\end{aligned}}$	(C.33)

Solving this obtains $v_{2}=0\,\!$ , $v_{1}=0\,\!$ , and $v_{3}\,\!$ is any non-zero number (which will be chosen to normalize the vector). For $\lambda =0\,\!$ ,


	${\begin{aligned}v_{1}&=iv_{2},\\v_{3}&=0.\end{aligned}}$	(C.34)

And finally, for $\lambda =2\,\!$ , we obtain


	${\begin{aligned}v_{1}-iv_{2}&=2v_{1},\\iv_{1}+v_{2}&=2v_{2},\\v_{3}&=2v_{3},\end{aligned}}$	(C.35)

so that $v_{1}=-iv_{2}\,\!$ . Therefore, our three eigenvectors are

v_{0}={\frac {1}{\sqrt {2}}}\left({\begin{array}{c}i\\1\\0\end{array}}\right),\;v_{1}=\left({\begin{array}{c}0\\0\\1\end{array}}\right),\;v_{2}={\frac {1}{\sqrt {2}}}\left({\begin{array}{c}-i\\1\\0\end{array}}\right).\,\!

The matrix

V=(v_{0},v_{1},v_{2})=\left({\begin{array}{ccc}i/{\sqrt {2}}&0&-i/{\sqrt {2}}\\1/{\sqrt {2}}&0&1/{\sqrt {2}}\\0&1&0\end{array}}\right)\,\!

is the matrix that diagonalizes $N\,\!$ in the following way:

N=VDV^{\dagger }\,\!

where

D=\left({\begin{array}{ccc}0&0&0\\0&1&0\\0&0&2\end{array}}\right).\,\!

We may write this as

V^{\dagger }NV=D.\,\!

This is sometimes called the eigenvalue decompostion of the matrix and can also be written as


	$N=\sum _{i}\lambda _{i}v_{i}v_{i}^{\dagger }.\,\!$	(C.36)

Example 3

Next, consider the complex $3\times 3$ Hermitian matrix

M=\left({\begin{array}{ccc}{\frac {5}{2}}&0&{\frac {i}{2}}\\0&2&0\\-{\frac {i}{2}}&0&{\frac {5}{2}}\end{array}}\right).\,\!

First we calculate

\det \left({\begin{array}{ccc}{\frac {5}{2}}-\lambda &0&{\frac {i}{2}}\\0&2-\lambda &0\\-{\frac {i}{2}}&0&{\frac {5}{2}}-\lambda \end{array}}\right)=(2-\lambda )\left[\left({\frac {5}{2}}-\lambda \right)^{2}-{\frac {1}{4}}\right].\,\!

This implies that the eigenvalues C.6 are

\lambda =2,2,{\mbox{ or }}3.\,\!

Note that there are two that are the same, or degenerate. Let $\lambda _{1}=2\,\!$ , $\lambda _{2}=2\,\!$ , and $\lambda _{3}=3\,\!$ . To find eigenvectors, we calculate


	${\begin{aligned}Mv&=\lambda v,\\\left({\begin{array}{ccc}{\frac {5}{2}}&0&{\frac {i}{2}}\\0&2&0\\-{\frac {i}{2}}&0&{\frac {5}{2}}\end{array}}\right)\left({\begin{array}{c}v_{1}\\v_{2}\\v_{3}\end{array}}\right)&=\lambda \left({\begin{array}{c}v_{1}\\v_{2}\\v_{3}\end{array}}\right)\end{aligned}}$	(C.37)

for each $\lambda \,\!$ . For $\lambda =3\,\!$ , we get the following equations:


	${\begin{aligned}{\frac {5}{2}}v_{1}+{\frac {i}{2}}v_{3}&=3v_{1},\\2v_{2}&=3v_{2},\\-{\frac {i}{2}}v_{1}+{\frac {5}{2}}v_{3}&=3v_{3},\end{aligned}}$	(C.38)

so


	${\begin{aligned}iv_{3}&=v_{1},\\v_{2}&=0.\end{aligned}}$	(C.39)

Now for $\lambda =2\,\!$ ,


	${\begin{aligned}{\frac {5}{2}}v_{1}+{\frac {i}{2}}v_{3}&=2v_{1},\\2v_{2}&=2v_{2},\\-{\frac {i}{2}}v_{1}+{\frac {5}{2}}v_{3}&=2v_{3},\end{aligned}}$	(C.40)

so


	${\begin{aligned}v_{3}&=iv_{1},\\v_{2}&={\mbox{anything}}.\end{aligned}}$	(C.41)

We would like to have a set of orthonormal vectors. (We can always choose the set to be orthonormal.) We choose the three eigenvectors to be

v_{2}=\left({\begin{array}{c}1\\a\\i\end{array}}\right),\;\;v_{2}^{\prime }=\left({\begin{array}{c}1\\a^{\prime }\\i\end{array}}\right),\;\;v_{3}=\left({\begin{array}{c}i\\0\\1\end{array}}\right).\,\!

We set the inner product of the two vectors $v_{2}\,\!$ and $v_{2}^{\prime }\,\!$ equal to zero so as to have then be orthogonal:

1+aa^{\prime }+1=2+aa^{\prime }=0.\,\!

Now we can choose $a={\sqrt {2}}\,\!$ and $a^{\prime }=-{\sqrt {2}}\,\!$ so that the normalized eigenvectors are

v_{2}={\frac {1}{2}}\left({\begin{array}{c}1\\{\sqrt {2}}\\i\end{array}}\right),\;\;v_{2}^{\prime }={\frac {1}{2}}\left({\begin{array}{c}1\\-{\sqrt {2}}\\i\end{array}}\right),\;\;v_{3}={\frac {1}{\sqrt {2}}}\left({\begin{array}{c}i\\0\\1\end{array}}\right).\,\!

Tensor Products

The tensor product, or the Kronecker product, is used extensively in quantum mechanics and throughout the course. It is commonly denoted with a $\otimes \,\!$ symbol, although this is often left out. In fact, the following are commonly found in the literature as notation for the tensor product of two vectors $\left\vert \Psi \right\rangle \,\!$ and $\left\vert \Phi \right\rangle \,\!$ :


	${\begin{aligned}\left\vert \Psi \right\rangle \otimes \left\vert \Phi \right\rangle &=\left\vert \Psi \right\rangle \left\vert \Phi \right\rangle \\&=\left\vert \Psi \Phi \right\rangle .\end{aligned}}$	(C.42)

Each of these has its advantages and will all be used in different circumstances in this text.

The tensor product is also often used for operators. Several examples will be given, one that explicitly calculates the tensor product for two vectors and one that calculates it for two matrices which could represent operators. However, these are not different in the sense that a vector is a $1\times n\,\!$ or an $n\times 1\,\!$ matrix. It is also noteworthy that the two objects in the tensor product need not be of the same type. In general, a tensor product of an $n\times m\,\!$ object (array) with a $p\times q\,\!$ object will produce an $np\times mq\,\!$ object.

The tensor product of two objects is computed as follows. Let $A\,\!$ be an $n\times m\,\!$ and $B\,\!$ be a $p\times q\,\!$ array,


	$A=\left({\begin{array}{cccc}a_{11}&a_{12}&\cdots &a_{1m}\\a_{21}&a_{22}&\cdots &a_{2m}\\\vdots &&\ddots &\\a_{n1}&a_{n2}&\cdots &a_{nm}\end{array}}\right),$	(C.43)

and similarly for $B\,\!$ . Then


	$A\otimes B=\left({\begin{array}{cccc}a_{11}B&a_{12}B&\cdots &a_{1m}B\\a_{21}B&a_{22}B&\cdots &a_{2m}B\\\vdots &&\ddots &\\a_{n1}B&a_{n2}B&\cdots &a_{nm}B\end{array}}\right).$	(C.44)

Let us now consider two examples. First let $\left\vert \phi \right\rangle \,\!$ and $\left\vert \psi \right\rangle \,\!$ be as before,

\left\vert \psi \right\rangle =\left({\begin{array}{c}\alpha \\\beta \end{array}}\right)\;\;{\mbox{and}}\;\;\left\vert \phi \right\rangle =\left({\begin{array}{c}\gamma \\\delta \end{array}}\right).\,\!

Then


	${\begin{aligned}\left\vert \psi \right\rangle \otimes \left\vert \phi \right\rangle &=\left({\begin{array}{c}\alpha \\\beta \end{array}}\right)\otimes \left({\begin{array}{c}\gamma \\\delta \end{array}}\right)\\&=\left({\begin{array}{c}\alpha \gamma \\\alpha \delta \\\beta \gamma \\\beta \delta \end{array}}\right).\end{aligned}}$	(C.45)

Also


	${\begin{aligned}\left\vert \psi \right\rangle \otimes \left\langle \phi \right\vert &=\left\vert \psi \right\rangle \left\langle \phi \right\vert \\&=\left({\begin{array}{c}\alpha \\\beta \end{array}}\right)\otimes \left({\begin{array}{cc}\gamma ^{}&\delta ^{}\end{array}}\right)\\&=\left({\begin{array}{cc}\alpha \gamma ^{}&\alpha \delta ^{}\\\beta \gamma ^{}&\beta \delta ^{}\end{array}}\right).\end{aligned}}$	(C.46)

Now consider two matrices

A=\left({\begin{array}{cc}a&b\\c&d\end{array}}\right)\;\;{\mbox{and}}\;\;B=\left({\begin{array}{cc}e&f\\g&h\end{array}}\right).\,\!

Then


	${\begin{aligned}A\otimes B&=\left({\begin{array}{cc}a&b\\c&d\end{array}}\right)\otimes \left({\begin{array}{cc}e&f\\g&h\end{array}}\right)\\&=\left({\begin{array}{cccc}ae&af&be&bf\\ag&ah&bg&bh\\ce&cf&de&df\\cg&ch&dg&dh\end{array}}\right).\end{aligned}}$	(C.47)

Properties of Tensor Products

Listed here are properties of tensor products that are useful, with $A\,\!$ , $B\,\!$ , $C\,\!$ , $D\,\!$ of any type:

$(A\otimes B)(C\otimes D)=AC\otimes BD\,\!$
$(A\otimes B)^{T}=A^{T}\otimes B^{T}\,\!$
$(A\otimes B)^{*}=A^{*}\otimes B^{*}\,\!$
$(A\otimes B)\otimes C=A\otimes (B\otimes C)\,\!$
$(A+B)\otimes C=A\otimes C+B\otimes C\,\!$
$A\otimes (B+C)=A\otimes B+A\otimes C\,\!$
${\mbox{Tr}}(A\otimes B)={\mbox{Tr}}(A){\mbox{Tr}}(B)\,\!$
If $A\,\!$ is an $n\times n\,\!$ matrix, and $B\,\!$ is an $m\times m\,\!$ matrix, then ${\mbox{det}}(A\otimes B)=({\mbox{det}}(A))^{m}({\mbox{det}}(B))^{n}\,\!$ .

(See Horn and Johnson, Topics in Matrix Analysis [10], Chapter 4.)

Exercises

Vectors

About the author
Foreword to the first edition
Foreword to the second edition

Matrices

About the author
Foreword to the first edition
Foreword to the second edition

Dirac Notation (bras and kets)

About the author
Foreword to the first edition
Foreword to the second edition

Transformations

About the author
Foreword to the first edition
Foreword to the second edition

Eigenvalues and Eigenvectors

About the author
Foreword to the first edition
Foreword to the second edition

Tensor Products

About the author
Foreword to the first edition
Foreword to the second edition

Appendix C - Vectors and Linear Algebra

Contents

Introduction

Vectors

Vectors Defining and Representing

Examples of Vectors

Real Vectors

The Magnitude or Length of a Vector

Vector Operations

Vector Addition

Example

Multiplication by a Number

Products of Two Vectors

Inner Products

Complex Vectors

Matrices

Basic Definition and Representations

Matrix Addition

Notation

The Identity Matrix

Complex Conjugate

Transpose

Hermitian Conjugate

The Inverse of a Matrix

Index Notation

The Trace

The Determinant

The Inverse of a Matrix

Hermitian Matrices

Unitary Matrices

Inner and Outer Products

More Dirac Notation

Transformations

Transformations of a Qubit

Similarity Transformation

Polar Decomposition and Singular Value Decomposition

Eigenvalues and Eigenvectors

Diagonalization

Example 1

Example 2

Example 3

Tensor Products

Properties of Tensor Products

Exercises

Navigation menu

Search