Chapter 4 - Entanglement
Contents
Introduction
Quantum entanglement is the likely the most uniquely quantum mechanical property of quantum systems. It is also believed to be responsible for many of the advantages that quantum information processing has over classical systems. It therefore behooves us to try to understand quantum entanglement and how it is different from what is present in classical systems and also to see how it is used. Some examples of its utility will be discussed in the next chapter. The examples include teleportation and dense coding.
Entangled states puzzled many of the founders of quantum theory and Einstein may be the most famous of these. It prompted the paper by Einstein, Podolsky, and Rosen (EPR)[48] which attempts to clarify exactly what is bothersome about entanglement and how there may be another explanation. It is not surprising that entanglement theory has become a central part of many investigations into quantum theory and especially quantum information and quantum foundations.
There are still many open problems and unanswered questions in this area of research. Some of the most basic and fundamental questions about entangled states are still unanswered. For example, given a mixed-state density matrix for a quantum system, with few exceptions, we still do not know how to tell if the systems being described by that matrix are entangled or not. Also, although there are ways with which to quantify the entanglement in a system of particles, these quantities are notoriously difficult to calculate. Here again, with few exceptions, we do not know how to calculate the amount of entanglement in a system analytically.
What we do understand and what we can explain is the entanglement between bipartite systems that are describable by pure quantum states. In this chapter, the problem of entanglement in quantum mechanics in general is discussed. Applications, extensions, and generalizations will be discussed in later sections.
EPR Paradox
Before diving into entanglement, in the context of quantum computing and information, it would be prudent to discuss the now-famous EPR paradox. It was first proposed in a paper by Einstein, Podolsky, and Rosen in 1935. The original paper includes a fairly simple mathematical explanation of the paradox---it is, however, not really necessary as the thought experiment is quite easily understood conceptually with (mostly) words. A slightly simplified version of the experiment will be given here.
Suppose a neutral pi meson, which has no spin, is at rest. It then decays into an electron and a positron, necessarily going in opposite directions. The wave function can now be written as
| (4.1) | 
As can be seen, we now have a system of two particles that have a correlated spin---one being up (the first, the electron, say) and the other being down (the positron, say)---with an equal probability for each configuration being the outcome of a measurement. The system is said to be entangled, as a measurement to find the value of the spin on one will guarantee that the other particle has the opposite spin. In other words, it cannot be written as because we can choose to measure either spin, in any direction, and the other must have the opposite spin to what is measured for the first particle.
Now what is the significance here? The orthodox position says that the wave function is the complete representation of the system. When the measurement occurs, the state of the system of the two particles is projected into one of two spin states, changing the system from a superposition to one of the two terms. (This is the "Born rule", or measurement postulate.) Each outcome will occur with equal probability.
But, in the context of EPR, how can this be? Imagine the entangled electron-positron pair are at opposite ends of the galaxy when one of them is measured. The conservation of angular momentum says that the other particle all of the way on the other side of the galaxy must instantly be the opposite spin as the measured particle. EPR argued that this is a violation of locality, which says an effect cannot travel faster than the speed of light. If the very action of measurement on one particle is what caused the other particle to realize the opposite spin, then locality has been violated. Therefore, the measurement could not have caused the projection of the state.
EPR concluded that this proves that quantum mechanics is incomplete---that the wave function is missing some information. There was no "spooky-action-at-a-distance," there must be some underlying property that is absent from the wave function. Einstein rejected the notion that a measurement caused this quasi-mystical collapse of the wave function (as it is sometimes called)---the particles do not care if they are being watched or not.
Bell's Theorem
The following discussion is primarily due to David Griffiths [4].
The peculiarities of the EPR paradox were convincing enough to drive many to examine possible "hidden variable theories." The basic idea is that there exists a quantity, often denoted by , that must be included in the wave function to completely describe the system. This is some variable, or collection of variables, that the state of the systems depend on in order to explain the observed behavior. J.S. Bell very elegantly showed in 1964 that this is not the case, using the thought experiment (although slightly modified) that EPR proposed.
Suppose we have another pion at rest about to decay with detectors oriented equidistant and on opposite sides, ready to measure the spin of the electron and positron. Further suppose that, unlike the previous scenario, these detectors can be rotated in order to detect the spin in the direction of unit vectors and for the electron and positron respectively.
When the electron and positron pair strikes the detectors, a spin up () or spin down () is registered. The product of the results is then examined. If they are oriented parallel, where , then the result will be -1. If anti-parallel, the result is then +1. The averages are, obviously,
| (4.2) | 
Quantum mechanics tells us that for arbitrary vectors,
| (4.3) | 
The hidden variable(s), can now be introduced. This can represent any number of variables that complete the description of the system and allow for locality. Then define some functions, and that will give the results for the measurement (either +1 or -1) for the electron and positron respectively.
The locality assumption tells us that the orientation of one detector will not affect the outcome of the measurement of the other detector; one can imagine a scenario where the distances and/or orientations of the detectors are chosen at a time too late for any information to be transferred slower than light. It must also be true that, when the detectors are parallel, the results must be
| (4.4) | 
due to the conservation of angular momentum. Now define a probability density, for the hidden variable. Since we know nothing of , this can be anything as long as it is non-negative and normalizable (). We can now look at the product of the measurements,
| (4.5) | 
From Eq.(4.4) this can be rewritten as:
| (4.6) | 
Now for the clever part. Introducing another unit vector, , and noting that
| (4.7) | 
Recognizing some inequalities,
| (4.8) | 
the following remarkable result is obtained,
| (4.9) | 
The last form is known as Bell's inequality. This inequality is true for any local hidden variable theory.
What does this mean? Let us define and to be orthogonal and to make a angle with both of them. Using quantum mechanics (Equation(4.3)),
Inserting the values into Bell's inequality (Equation (4.9)),
Since the inequality is violated!
This means that quantum mechanics is incompatible with any local hidden variable theory. The EPR paradox had stronger implications than the authors realized; if local realism is held, then quantum mechanics is incorrect. This has been repeatedly disproved experimentally. Thus no local hidden variable theory can resolve the "spooky-action-at-a-distance."
Entangled Pure States
It is now possible for us to think about entangled states as those that are more correlated than any classical state. Bell's inequality is one way to identify such states and it can be used to find more entangled states.
Let us consider two quantum systems, one called A and the other B. Let us suppose the joint state of the entire system, comprised of A and B, is a pure state. If the subsystems are independent and have never interacted, then the state of the composite system of the two particles can be written as
| (4.10) | 
where is the state of particle A and is the state of subsystem B. This tensor product structure is sometimes stated as a postulate of quantum mechanics as in Nielsen and Chuang's book [2]. In this case the two particles are not correlated in any way---they are said to be unentangled or separable. When a pure state cannot be written in this form it is said to be entangled.
For example, the most general form for a pure state of two qubits is given by Eq. (2.30).
Examples are given below of states that are entangled and thus cannot be written in the form of Eq. (4.10).
For two particles (or systems) to become entangled, they must first interact with each other. This entanglement cannot increase (usually) by acting on an individual subsystem or even both subsystems separately.  Only joint measurements on both or interactions between the two can increase entanglement.  Actions on an individual particle, without involving the other, are called local actions or local operations.   For example, local unitary operations on individual particles can be written as
| (4.11) | 
so that
| (4.12) | 
Local unitary transformations will not change the entanglement of a system. Furthermore, local measurements or local measurements combined with unitary transformations cannot, on average, increase the entanglement between subsystems.
For later use, it is relevant to note that the density matrix for the composite system in Eq.(4.10) is 
| (4.13) | 
where and ---the density operator of a product state is the product of density operators.
Bell States
The simplest examples of entangled states are the entangled states of two two-state systems. There are four different versions of what is known as the "maximally entangled state" of two qubits. The "maximally" will be explained below. These four different versions are called Bell states and are
| (4.14) | 
This is an orthonormal set of states that are all able to be obtained from each other by acting on one particle alone or both individual particles with unitary transformations (i.e. acting with local unitary transformations). For example, consider the local unitary transformation acting on . The result is . Similarly, acting on yields , and so on.
These states certainly cannot be written in the form
What if they could? Let and . Notice that the general form is
| (4.15) | 
so the coefficient of times the coefficient of minus the coefficient of times the coefficient of is zero. This is not true for any of the Bell states---thus, they cannot be written as a tensor product of two 1-particle states. So, for any 2-particle state,
| (4.16) | 
the state is separable or unentangled if (and only if!) . Otherwise, it is entangled.
Entangled Mixed States
The state in Eq.(4.1) is not entangled, so it is called separable. More precisely it is referred to as a simply separable state. In general, a state is separable if its density matrix can be written in the form
| (4.17) | 
where is a valid density matrix for subsystem , and . An entangled state is one that cannot be written in the form of Eq.(4.17).
For a pure state, the situation is simpler. A pure state is entangled if and only if it cannot be written in the form of Eq.(4.10). In other words a pure state is entangled if it cannot be written as the product of two states of the individual subsystems.
Reduced Density Operators and the Partial Trace
The Bell states are maximally entangled states. To understand this, one may consider the fact that these states are pure states but information about the individual particles in the system is lacking. In this section, a more precise meaning of this statement is given.
Let us first consider a useful tool, the partial trace. The partial trace is the trace over one of the subsystems (particle states) of a composite system. Let us suppose that the density matrix for a composite system is given by
| (4.18) | 
The partial trace is the trace over one of the subsystems. For example, the trace over subsystem is
| (4.19) | 
since and the trace of a density matrix is one. The matrix is called the reduced density operator, or reduced density matrix. However, this is a special case. The density matrix for a composite system of two (or more) subsystems cannot be written in this form except in very special circumstances---when the two subsystems have never interacted and there are no correlations between them.
For the cases where the two subsystems are entangled, there are at least two ways to calculate the partial trace. One is to write the matrix form of the state in terms of a sum of the tensor products of Pauli matrices. (See Appendix E - Density Operator: Extensions, Sec. Two-State Example: Bloch Sphere.) The other is to realize that the trace can be calculated by summing the projections onto the diagonal elements of the subsystem over which you are tracing. For example, for a general density matrix , the trace is
| (4.20) | 
For the general case, let us consider a density matrix for a bipartite system, . Let the subsystem have Greek letters as indices and let the subsystem have Latin indices. Then
| (4.21) | 
To calculate the reduced density matrix of subsystem by tracing over subsystem , the trace over is taken by computing
| (4.22) | 
For the partial trace of a  density matrix  over the
subsystem , 
| (4.23) | 
which leaves the part of the matrix corresponding to alone and projects the part onto the two diagonal elements and then adds those. Now let us calculate the partial trace of a Bell state, for example . Assuming the first state is the state and the second , we see
| (4.24) | 
which can be rewritten simply as
| (4.25) | 
This is quite an interesting and significant find. The density matrix for the whole system of two qubits is in a pure state indicating maximal knowledge. However, the reduced density matrix, representing our knowledge of one of the individual particles, is completely (or maximally) mixed, indicating minimal knowledge. This means that the two particles or subsystems taken together are in a definite, pure state; yet when taken separately, they contain as little information as possible. This fact indicates entanglement.
It is important to note that for a pure state, the trace over subsystem produces the same result as the trace over subsystem . In other words, for a pure state ,
| (4.26) | 
The Partial Transpose
Quantifying Entanglement
The previous discussion showed that there is a definite notion of maximal entanglement for pure states. From the determinant condition, there is a method for identifying unentangled pure states. A question may arise: how entangled is it if it is not separable nor maximally entangled? There are now many ways of defining measures of entanglement which will be explored in a later section and an appendix. Here, a common way of measuring the entanglement for pure states is given, which is based on the partial trace of a pure state.
Let us consider the extreme cases. First, we found that if the states are Bell states, then the partial trace of the bipartite system of two qubits will yield a density operator for the subsystem that is maximally entangled. Due to the fact that the trace of the density operator is always one, it should be clear that the partial trace of a separable pure state gives a pure state density operator. Notice that the purity of the partial density operator provides a candidate for a measure of entanglement. Before we discuss this further, let us note the following important result, called the Schmidt decomposition.
Schmidt Decomposition
For any pure state density operator of a bipartite system, say with constituents A and B, it can be taken as
| (4.27) | 
This can also be put in the form
| (4.28) | 
Proof: To show that the state can be written in this way, choose unitary matrices and and write:
| (4.29) | 
Since and are unitary, and similarly for ; so this is the same as the original state. This can be rewritten again
| (4.30) | 
Now the result:
| (4.31) | 
This last expression follows from the definitions , and . End Proof
In this case it is clear that the reduced density operator is the same for each subsystem and thus the purity of the reduced density operator can be used as a measure of entanglement.
Extensions and Open Problems
Continue to Chapter 5 - Quantum Information: Basic Principles and Simple Examples

