Introduction
It was a great realization that information is physical and that a
(classical) Turing machine is not the end of the story of
computation.  The physical system in which the information is stored
and manipulated is important and qubits are quite different from
bits.  
In this chapter, some background in quantum mechanics is provided.
Not all of this chapter will be directly relevant to our discussion,
but it is included to progress our understanding
of how quantum mechanics from a textbook is related to quantum
computing.  The connection is clear, but the story seems
incomplete from a physicists perspective.  For the subject of error
prevention methods, some of this chapter will be vital---in
particular, the section(s) concerning the density matrix.  Not only
is this vital, it often not covered in quantum mechanics
classes, both undergraduate and graduate.  
It is also worth emphasizing that this chapter is primarily aimed at
physicists and for those others who are interested in the background
physics.  However, it is not necessary for much of what follows.
Schrodinger's Equation
A common starting point in quantum mechanics is Schrodinger's equation.  This equation is not derived or justified here, but is given in a general form:
|  | 
| 
 |  | (3.1) | 
where  is the Hamiltonian,
 is the Hamiltonian, 
 is Planck's constant 
(divided by
 is Planck's constant 
(divided by  ), and
), and  is time.  The Hamiltonian contains what
is known about the system's evolution.  
Most of the time in these notes, we let
 is time.  The Hamiltonian contains what
is known about the system's evolution.  
Most of the time in these notes, we let  .
.  
This equation is (formally) solved by taking the time derivative to be
an ordinary derivative (we assume no explicit time dependence for
 ), so
), so 
|  | 
| 
 |  | (3.2) | 
This means that 
|  | 
| 
 |  | (3.3) | 
so
|  | 
| 
 |  | (3.4) | 
Now if  is Hermitian (it is), then the matrix
 is Hermitian (it is), then the matrix 
|  | 
| 
 |  | (3.5) | 
is unitary.  
(If this is unclear, see Appendix C - Vectors and Linear Algebra, in particular the section entitled Unitary Matrices.)  Any
transformation on a closed system can be described by a unitary
transformation and any unitary transformation can be obtained by the
exponentiation of a Hermitian matrix.  
The end result and important point is that the evolution of a quantum
state is, in general, given by a unitary matrix
|  | 
| 
 |  | (3.6) | 
So our objective in quantum information processing is to create a
unitary evolution, and eventual measurement, which will produce a
particular outcome.
Exponentiating a Matrix
 Aside: a note about the exponentiation of a matrix.
It may seem strange to exponentiate a matrix.  However, you can define
a function of a matrix according to its Taylor expansion.  The details
of this are primarily unimportant here, but for demonstration purposes,
it is written out.  
The Taylor expansion of an exponential is the following:
|  | 
| 
 |  | (3.7) | 
and this can be used to exponentiate a matrix by letting the matrix
replace  in the equation.  This can also be used to prove that
 in the equation.  This can also be used to prove that 
|  | 
| 
 |  | (3.8) | 
End Aside
Density Matrix for Pure States
Now let us consider the object (a density matrix, or 
density operator, of rank one) 
|  | 
| 
 |  | (3.9) | 
which is just the outer product of two vectors.  (See Appendix C, Sec. C.2.4.) 
Since  ,
,  is also true.  If we
differentiate this with respect to
 is also true.  If we
differentiate this with respect to  , we discover
, we discover
|  | 
| 
 | ![{\displaystyle {\begin{aligned}{\frac {\partial \rho }{\partial t}}&=\left({\frac {\partial \left\vert \psi \right\rangle }{\partial t}}\right)\left\langle \psi \right\vert +\left\vert \psi \right\rangle \left({\frac {\partial \left\langle \psi \right\vert }{\partial t}}\right)\\&=(-iH)\rho +\rho (iH)=-i[H,\rho ].\end{aligned}}}](https://wikimedia.org/api/rest_v1/media/math/render/svg/ca7e05f94763f40bcc2c90f6f1916855213d69aa) | (3.10) | 
This is merely the Schrodinger equation for a density matrix with the solution
|  | 
| 
 |  | (3.11) | 
This follows from  .
. 
Consider our two-state system, 
|  | 
| 
 |  | (3.12) | 
Recall that the arbitrary superposition of these states is shown by 
|  | 
| 
 |  | (3.13) | 
where  and
 and  are complex numbers such that
 are complex numbers such that 
 .  The corresponding 
pure state (i.e. rank one) density matrix is given by
.  The corresponding 
pure state (i.e. rank one) density matrix is given by 
|  | 
| 
 |  | (3.14) | 
Note that the superposition in Eq.(3.13) can be obtained from any pure state by a unitary transformation.  Here, the trace of
the density matrix is an important quantity; it is
|  | 
| 
 |  | (3.15) | 
Notice also that the determinant of this matrix is zero, indicating that it has a zero eigenvalue:
|  | 
| 
 |  | (3.16) | 
To see this another way, note that the density operator of rank one can be written as  , so that the determinant is
, so that the determinant is 
|  | 
| 
 |  | (3.17) | 
This is a characteristic of a pure state and for two-state systems; it is a necessary and sufficient condition for the density operator to represent a pure state of the system.
Measurements Revisited
If the state of a quantum system is described by
|  | 
| 
 |  | (3.18) | 
then the probability of finding it in the state  when measured in
the computational basis is
 when measured in
the computational basis is  .  However, this is a
particular superposition that could be written as
.  However, this is a
particular superposition that could be written as 
|  | 
| 
 |  | (3.19) | 
In the section entitled Schrodinger's Equation it was shown that this matrix  results
from the exponentiation of a Hermitian matrix. Recall from the section entitled The Pauli Matrices that any
 results
from the exponentiation of a Hermitian matrix. Recall from the section entitled The Pauli Matrices that any  Hermitian matrix can be written in terms of the Pauli matrices.  To make this explicit using standard conventions,
Hermitian matrix can be written in terms of the Pauli matrices.  To make this explicit using standard conventions, 
|  | 
| 
 |  | (3.20) | 
where  is a unit vector,
 is a unit vector,  and
 and  . 
One can write this matrix out explicitly,
. 
One can write this matrix out explicitly, 
|  | 
| 
 | ![{\displaystyle {\begin{aligned}\exp(-i{\vec {n}}\cdot {\vec {\sigma }}\theta )&=\left({\begin{array}{cc}1&0\\0&1\end{array}}\right)\cos(\theta )\\&\;\;\;+(-i)\left[n_{1}\left({\begin{array}{cc}0&1\\1&0\end{array}}\right)+n_{2}\left({\begin{array}{cc}0&-i\\i&0\end{array}}\right)+n_{3}\left({\begin{array}{cc}1&0\\0&-1\end{array}}\right)\right]\sin(\theta )\\&=\left({\begin{array}{cc}\cos(\theta )-in_{3}\sin(\theta )&(-in_{1}-n_{2})\sin(\theta )\\(-in_{1}+n_{2})\sin(\theta )&\cos(\theta )+in_{3}\sin(\theta )\end{array}}\right).\end{aligned}}}](https://wikimedia.org/api/rest_v1/media/math/render/svg/cb7a2b14fe8ade382efd12239714dd47fcbd0410) | (3.21) | 
Notice this is a  special unitary matrix.  (See Appendix C - Vectors and Linear Algebra, in particular the subsection Unitary Matrices.)
To see that any state  for arbitrary coefficients
 for arbitrary coefficients
 ,
,  can be obtained by choosing
 can be obtained by choosing  and
 and  appropriately, the state
appropriately, the state  can be chosen as a starting point.  
Then,
 can be chosen as a starting point.  
Then, 
|  | 
| 
 |  | (3.22) | 
For example, choosing  gives the original state; choosing
 gives the original state; choosing
 and
 and  gives
 gives  ; and choosing
; and choosing
 and
 and  gives an equal superposition.  
In general, when the system is in the state
 gives an equal superposition.  
In general, when the system is in the state   ,
the probability of finding the state
,
the probability of finding the state  when a measurement is made in the computational basis is given by
 when a measurement is made in the computational basis is given by 
|  | 
| 
 |  | (3.23) | 
and the probability of finding  is
 is
|  | 
| 
 |  | (3.24) | 
Notice that the probabilities add up to one if  is a unit vector.
 is a unit vector.  
What this shows is that there is a transformation that takes the state
 , which has probability
, which has probability  of being in the state
 of being in the state  and
probability
 and
probability  of being in the state
 of being in the state  , and transforms it
(using a "rotation'') into a state with a different (and generic)
probability of each.  This means that the density matrix corresponding
to this system always has determinant zero, meaning that(for a two-state system) it has one
eigenvalue 1 and another eigenvalue 0.  (The determinant is the
product of the eigenvalues.)
, and transforms it
(using a "rotation'') into a state with a different (and generic)
probability of each.  This means that the density matrix corresponding
to this system always has determinant zero, meaning that(for a two-state system) it has one
eigenvalue 1 and another eigenvalue 0.  (The determinant is the
product of the eigenvalues.)
Density Matrix for Mixed States
For a system with  dimensions, a mixed state density matrix 
(or density operator, see Appendix \ref{app:cohvec})  is a matrix that is used to
describe a more general state of a quantum system.  This can be written as
 dimensions, a mixed state density matrix 
(or density operator, see Appendix \ref{app:cohvec})  is a matrix that is used to
describe a more general state of a quantum system.  This can be written as 
|  | 
| 
 |  | (3.25) | 
where  ,
,  , and the
, and the  are pure states.  There is also a generalization of the Bloch sphere which is described in Appendix {app:polvec}.
 are pure states.  There is also a generalization of the Bloch sphere which is described in Appendix {app:polvec}.  
Mixed state  density matrices are important in all descriptions of physical implementations of quantum information processing.  For this reason, a bit of labor should go into understanding the density matrix. The rest of this section is devoted to the physical interpretation and properties of this description of a quantum system.  The first description presented is called the ensemble interpretation of the density matrix.  This is perhaps the easiest to understand.  Another set of physical systems that are described by density matrices will be given elsewhere.
General Properties
In general, a density matrix has the following properties:
|  | 
| 
 |  | (3.26) | 
If, in addition, it is a pure state, then 
|  | 
| 
 |  | (3.27) | 
The second property in Eq.(3.28) really means that the eigenvalues of the density matrix are greater than or equal to zero.
Density Matrix for a Mixed State: Two States
A mixed state density matrix for a two-state system is a rank two density matrix,  , which can be described by
, which can be described by 
|  | 
| 
 | ![{\displaystyle \rho _{m}=\left[a_{1}\rho _{1}+a_{2}\rho _{2}\right],}](https://wikimedia.org/api/rest_v1/media/math/render/svg/7482f8b2e95b11c4042c43308c2ce30f35c98e6c) | (3.28) | 
where  ,
,  and
 
and  .  The
.  The  are probabilities and must sum to one.
(Note, if
 are probabilities and must sum to one.
(Note, if  , or if one
, or if one  or one
 or one
 is zero, then this reduces to a pure state.)  In this mixture, 
the probability of finding the state
 is zero, then this reduces to a pure state.)  In this mixture, 
the probability of finding the state  is
 is  and the probability of finding the state
and the probability of finding the state  is
 is  .
.
Description of Open Quantum Systems: An Example
One example of the utility of a density matrix is the following
statistical problem.  Let us consider a collection of electrons in a box, where their
spin is a two-state system being either up or down when measured.  If
a subset of these electrons is prepared in the state ''up'' before
being put in the box and the rest ''down,'' then the description of
the system of particles is given by 
|  | 
| 
 |  | (3.29) | 
where the fraction of  ''up'' particles is  and the fraction of ''down'' is
 and the fraction of ''down'' is  .  Our system is described by this density matrix---if a particle is chosen at random from the box and measured, the state of the particle is
.  Our system is described by this density matrix---if a particle is chosen at random from the box and measured, the state of the particle is  with probability
 with probability  and
and  with probability
 with probability  .  This is known as the "statistical
interpretation" of the density operator.
.  This is known as the "statistical
interpretation" of the density operator. 
There is another example that is more relevant for our purposes. Let us consider another two-state system.
If there is some probability  for an error to occur, let us say it is a unitary operator
 for an error to occur, let us say it is a unitary operator  , then the density matrix for the
system is
, then the density matrix for the
system is 
|  | 
| 
 |  | (3.30) | 
This is the same form as Eq.(3.31).  
Note that in each 
case the probabilities associated with the density matrix  , and
, and
 , (generally,
, (generally,  ) are classical probabilities;
they are associated with a classical probability distribution---the
probability for error/no error and up/down.  These are not
probabilities associated with the superposition of the quantum state
in the equation
) are classical probabilities;
they are associated with a classical probability distribution---the
probability for error/no error and up/down.  These are not
probabilities associated with the superposition of the quantum state
in the equation  given by the square of the moduli of the coefficients.  This is an
important distinction!  The state
given by the square of the moduli of the coefficients.  This is an
important distinction!  The state
 can be taken to the state
 can be taken to the state  with a unitary
transformation.  This state is deterministic in the sense that the
result
 with a unitary
transformation.  This state is deterministic in the sense that the
result  will be obtained from a measurement in the
computational basis since there is no probability for obtaining
 will be obtained from a measurement in the
computational basis since there is no probability for obtaining
 .  However, for nonzero
.  However, for nonzero  and a non-identity
operator
 and a non-identity
operator  , the matrix
, the matrix  has rank two and thus can never have
probability
 has rank two and thus can never have
probability  for either of the two states,
 for either of the two states,  or
 or  .
Thus we have maximum knowledge about a pure state since
there is a way to choose a measurement, perhaps after a unitary
transformation, which achieves a certain result with probability
.
Thus we have maximum knowledge about a pure state since
there is a way to choose a measurement, perhaps after a unitary
transformation, which achieves a certain result with probability  .
For the mixed state density operator this is not possible.  The state
.
For the mixed state density operator this is not possible.  The state 
|  | 
| 
 |  | (3.31) | 
for which we have the least amount of knowledge, is called the
maximally mixed state.   The
state could be either up or down with equal probability and neither is
a better guess.  If the two eigenvalues are not equal, then there is a
better guess (or bet) as to the result of a measurement. If one
eigenvalue is zero, then there is a definite best guess.  
To be more specific, independent of basis (unitary transformations),
one always has a probability greater than zero of measuring
 and probability greater than zero of measuring
 and probability greater than zero of measuring
 . Thus the  state described by the density matrix is
a mixed state  in the sense
that it can be considered a statistical mixture of the  two states
. Thus the  state described by the density matrix is
a mixed state  in the sense
that it can be considered a statistical mixture of the  two states
 and
 and  .  This, because classical
probabilities are included separately, is significantly different from
the pure state density matrix, which is a special case of all density
matrices.
.  This, because classical
probabilities are included separately, is significantly different from
the pure state density matrix, which is a special case of all density
matrices.  
To see that mixtures remain after a unitary transformation on the
system, note that a unitary matrix does not change the eigenvalues.  
This is because the eigenvalue equation is the same for a Hermitian
matrix and its corresponding diagonal matrix.  Let  .  It can now be seen,
.  It can now be seen, 
|  | 
| 
 |  | (3.32) | 
Two-State Example: Bloch Sphere
Since our interest is primarily in qubits, which are two-state
systems, we return again to an example.  
A very convenient representation of two state density matrices, one
can written in the so-called Bloch sphere 
representation given the fact that the density matrix is Hermitian, 
|  | 
| 
 |  | (3.33) | 
where, for the density matrix to be positive  , and the
, and the
 are the Pauli matrices
 are the Pauli matrices 
|  | 
| 
 |  | (3.34) | 
The matrix entries on the RHS of this equation are the The Pauli matrices discussed above.  It is not difficult to convince yourself that any Hermitian matrix can be written as a real linear combination of the three Pauli matrices and the identity.  The eigenvalues are given by
|  | 
| 
 |  | (3.35) | 
When  , the state is pure, i.e., that the matrix 
has rank one since it has one eigenvalue one and one zero.  If
, the state is pure, i.e., that the matrix 
has rank one since it has one eigenvalue one and one zero.  If  , the density matrix represents a mixed state since rank is
greater than one--there are two non-zero eigenvalues.  These leads to
the following picture: the pure states lie on the surface of the
sphere (
, the density matrix represents a mixed state since rank is
greater than one--there are two non-zero eigenvalues.  These leads to
the following picture: the pure states lie on the surface of the
sphere ( ), and mixed states lie in the interior of
the sphere with the maximally mixed state at the origin.  This is
supposedly due to Bloch. Hence the name Bloch sphere.
), and mixed states lie in the interior of
the sphere with the maximally mixed state at the origin.  This is
supposedly due to Bloch. Hence the name Bloch sphere.  
Using  the condition that
 the condition that  for a pure
state can also be determined.  The square in the Bloch sphere
representation yields
 for a pure
state can also be determined.  The square in the Bloch sphere
representation yields
|  | 
| 
 |  | (3.36) | 
and using 
|  | 
| 
 |  | (3.37) | 
then  if and only if
 if and only if  .  This technique is
used for higher dimensions.  See Appendix E
.  This technique is
used for higher dimensions.  See Appendix E
Two density matrices  and
 and 
 , correspond to orthogonal 
states when
, correspond to orthogonal 
states when 
|  | 
| 
 |  | (3.38) | 
This implies that 
|  | 
| 
 |  | (3.39) | 
Since the magnitudes must be one, the orthogonal states correspond to 
pure states on a surface of a sphere which are represented by 
antipodal points.
Rotations of Bloch Vectors
As shown above, the solution to the Schrodinger equation for the density operator is (see Eq.(3.11))
 
In general an open system will evolve according to 
 
whether or not the time dependence is explicitly taken into account.  When the density operator is represented using the Bloch vector, the vector is rotated by the unitary transformation.  This is seen through an explicit calculation.  
There are two ways to see this.  One is to simply act with the matrices in the Euler angle parameterization in Section C.5.1 one each of the Pauli matrices to show that indeed,
|  | 
| 
 |  | (3.40) | 
This is easily seen to be a standard rotation matrix.  (See for example http://en.wikipedia.org/wiki/Rotation_matrix.)  
Another way to do this is to take 
 
as in Eq.(3.33).  (Recall  .)  Now act on
.)  Now act on  with
 with  as given in Section C.5.1 by the so-called adjoint action
 as given in Section C.5.1 by the so-called adjoint action  ,
, 
|  | 
| 
 | ![{\displaystyle \rho =[\mathbb {I} \cos(\theta /2)-i{\vec {n}}\cdot {\vec {\sigma }}\sin(\theta /2)]{\frac {1}{2}}(\mathbb {I} +{\vec {m}}\cdot {\vec {\sigma }})[\mathbb {I} \cos(\theta /2)+i{\vec {n}}\cdot {\vec {\sigma }}\sin(\theta /2)].\,\!}](https://wikimedia.org/api/rest_v1/media/math/render/svg/0d23d42a514d4ff010335c4c5c138612214a13f7) | (3.41) | 
To do this calculation explicitly, it helps (but is not necessary) to use the following identity,
|  | 
| 
 |  | (3.42) | 
Then, if one only considers the non-trivial part of the density operator,  , the result is
, the result is 
|  | 
| 
 |  | (3.43) | 
or 
|  | 
| 
 | ![{\displaystyle {\begin{aligned}e^{-i{\vec {n}}\cdot {\vec {\sigma }}\theta /2}{\vec {m}}\cdot {\vec {\sigma }}e^{i{\vec {n}}\cdot {\vec {\sigma }}\theta /2}&={\frac {1}{2}}{\vec {m}}\cdot {\vec {\sigma }}\cos(\theta )+{\frac {1}{2}}({\vec {n}}\cdot {\vec {m}})({\vec {n}}\cdot {\vec {\sigma }})\cos(\theta )+({\vec {n}}\cdot {\vec {m}})({\vec {n}}\cdot {\vec {\sigma }})\\&\;\;\;\;+({\vec {n}}\times {\vec {m}})\cdot {\vec {\sigma }}\sin(\theta )+{\frac {1}{2}}[({\vec {n}}\times {\vec {m}})\times {\vec {n}}]\cdot {\vec {\sigma }}\cos(\theta )\end{aligned}}\,\!}](https://wikimedia.org/api/rest_v1/media/math/render/svg/e017a899f6966c233388a14fe0f23feff9e447a3) | (3.44) | 
where
|  | 
| 
 |  | (3.45) | 
Therefore, the result of the action of  is to produce, from
 is to produce, from   , the vector
, the vector
|  | 
| 
 |  | (3.46) | 
This equation can be interpreted as follows.  We consider three components of the vector, the part along the axis of rotation and the two parts in the plane perpendicular to the axis of rotation.  The part of the vector along the axis of rotation  does not change.  The parts perpendicular to
 does not change.  The parts perpendicular to  change just like a vector rotated in a plane, but these parts are rotated in the plane perpendicular to the rotation axis and sitting at the end of the vector
 change just like a vector rotated in a plane, but these parts are rotated in the plane perpendicular to the rotation axis and sitting at the end of the vector  .  It takes a bit of geometry and vector algebra to show this is the case.
.  It takes a bit of geometry and vector algebra to show this is the case.
Expectation Values
The expectation value  
of an operator  , is given by
, is given by 
|  | 
| 
 |  | (3.47) | 
and is the "average value" of the operator.  For a pure state 
 , this reduces to
, this reduces to 
|  | 
| 
 | Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle  (\langle \mathcal{O} \rangle)_p = \left\langle\psi\right\vert \mathcal{O}\left\vert \psi\right\rangle.   } | (3.48) | 
Continue to Chapter 4 - Entanglement