---
title: "Bell's theorem"
sort_title: "Bell's theorem"
date: 2021-03-28
categories:
- Physics
- Quantum mechanics
- Quantum information
layout: "concept"
---

**Bell's theorem** states that the laws of quantum mechanics
cannot be explained by theories built on
so-called **local hidden variables** (LHVs).

Suppose that we have two spin-1/2 particles, called $$A$$ and $$B$$,
in an entangled [Bell state](/know/concept/bell-state/):

$$\begin{aligned}
    \Ket{\Psi^{-}}
    = \frac{1}{\sqrt{2}} \Big( \Ket{\uparrow \downarrow} - \Ket{\downarrow \uparrow} \Big)
\end{aligned}$$

Since they are entangled,
if we measure the $$z$$-spin of particle $$A$$, and find e.g. $$\Ket{\uparrow}$$,
then particle $$B$$ immediately takes the opposite state $$\Ket{\downarrow}$$.
The point is that this collapse is instant,
regardless of the distance between $$A$$ and $$B$$.

Einstein called this effect "action-at-a-distance",
and used it as evidence that quantum mechanics is an incomplete theory.
He said that there must be some **hidden variable** $$\lambda$$
that determines the outcome of measurements of $$A$$ and $$B$$
from the moment the entangled pair is created.
However, according to Bell's theorem, he was wrong.

To prove this, let us assume that Einstein was right, and some $$\lambda$$,
which we cannot understand, let alone calculate or measure, controls the results.
We want to know the spins of the entangled pair
along arbitrary directions $$\vec{a}$$ and $$\vec{b}$$,
so the outcomes for particles $$A$$ and $$B$$ are:

$$\begin{aligned}
    A(\vec{a}, \lambda) = \pm 1
    \qquad \quad
    B(\vec{b}, \lambda) = \pm 1
\end{aligned}$$

Where $$\pm 1$$ are the eigenvalues of the Pauli matrices
in the chosen directions $$\vec{a}$$ and $$\vec{b}$$:

$$\begin{aligned}
    \hat{\sigma}_a
    &= \vec{a} \cdot \vec{\sigma}
    = a_x \hat{\sigma}_x + a_y \hat{\sigma}_y + a_z \hat{\sigma}_z
    \\
    \hat{\sigma}_b
    &= \vec{b} \cdot \vec{\sigma}
    = b_x \hat{\sigma}_x + b_y \hat{\sigma}_y + b_z \hat{\sigma}_z
\end{aligned}$$

Whether $$\lambda$$ is a scalar or a vector does not matter;
we simply demand that it follows an unknown probability distribution $$\rho(\lambda)$$:

$$\begin{aligned}
    \int \rho(\lambda) \dd{\lambda} = 1
    \qquad \quad
    \rho(\lambda) \ge 0
\end{aligned}$$

The product of the outcomes of $$A$$ and $$B$$ then has the following expectation value.
Note that we only multiply $$A$$ and $$B$$ for shared $$\lambda$$-values:
this is what makes it a **local** hidden variable:

$$\begin{aligned}
    \Expval{A_a B_b}
    = \int \rho(\lambda) \: A(\vec{a}, \lambda) \: B(\vec{b}, \lambda) \dd{\lambda}
\end{aligned}$$

From this, two inequalities can be derived,
which both prove Bell's theorem.


## Bell inequality

If $$\vec{a} = \vec{b}$$, then we know that $$A$$ and $$B$$ always have opposite spins:

$$\begin{aligned}
    A(\vec{a}, \lambda)
    = A(\vec{b}, \lambda)
    = - B(\vec{b}, \lambda)
\end{aligned}$$

The expectation value of the product can therefore be rewritten as follows:

$$\begin{aligned}
    \Expval{A_a B_b}
    = - \int \rho(\lambda) \: A(\vec{a}, \lambda) \: A(\vec{b}, \lambda) \dd{\lambda}
\end{aligned}$$

Next, we introduce an arbitrary third direction $$\vec{c}$$,
and use the fact that $$( A(\vec{b}, \lambda) )^2 = 1$$:

$$\begin{aligned}
    \Expval{A_a B_b} - \Expval{A_a B_c}
    &= - \int \rho(\lambda) \Big( A(\vec{a}, \lambda) \: A(\vec{b}, \lambda) - A(\vec{a}, \lambda) \: A(\vec{c}, \lambda) \Big) \dd{\lambda}
    \\
    &= - \int \rho(\lambda) \Big( 1 - A(\vec{b}, \lambda) \: A(\vec{c}, \lambda) \Big) A(\vec{a}, \lambda) \: A(\vec{b}, \lambda) \dd{\lambda}
\end{aligned}$$

Inside the integral, the only factors that can be negative
are the last two, and their product is $$\pm 1$$.
Taking the absolute value of the whole left,
and of the integrand on the right, we thus get:

$$\begin{aligned}
    \Big| \Expval{A_a B_b} - \Expval{A_a B_c} \Big|
    &\le \int \rho(\lambda) \Big( 1 - A(\vec{b}, \lambda) \: A(\vec{c}, \lambda) \Big)
    \: \Big| A(\vec{a}, \lambda) \: A(\vec{b}, \lambda) \Big| \dd{\lambda}
    \\
    &\le \int \rho(\lambda) \dd{\lambda} - \int \rho(\lambda) A(\vec{b}, \lambda) \: A(\vec{c}, \lambda) \dd{\lambda}
\end{aligned}$$

Since $$\rho(\lambda)$$ is a normalized probability density function,
we arrive at the **Bell inequality**:

$$\begin{aligned}
    \boxed{
        \Big| \Expval{A_a B_b} - \Expval{A_a B_c} \Big|
        \le 1 + \Expval{A_b B_c}
    }
\end{aligned}$$

Any theory involving an LHV $$\lambda$$ must obey this inequality.
The problem, however, is that quantum mechanics dictates the expectation values
for the state $$\Ket{\Psi^{-}}$$:

$$\begin{aligned}
    \Expval{A_a B_b} = - \vec{a} \cdot \vec{b}
\end{aligned}$$

Finding directions which violate the Bell inequality is easy:
for example, if $$\vec{a}$$ and $$\vec{b}$$ are orthogonal,
and $$\vec{c}$$ is at a $$\pi/4$$ angle to both of them,
then the left becomes $$0.707$$ and the right $$0.293$$,
which clearly disagrees with the inequality,
meaning that LHVs are impossible.


## CHSH inequality

The **Clauser-Horne-Shimony-Holt** or simply **CHSH inequality**
takes a slightly different approach, and is more useful in practice.

Consider four spin directions, two for $$A$$ called $$\vec{a}_1$$ and $$\vec{a}_2$$,
and two for $$B$$ called $$\vec{b}_1$$ and $$\vec{b}_2$$.
Let us introduce the following abbreviations:

$$\begin{aligned}
    A_1 &= A(\vec{a}_1, \lambda)
    \qquad \quad
    A_2 = A(\vec{a}_2, \lambda)
    \\
    B_1 &= B(\vec{b}_1, \lambda)
    \qquad \quad
    B_2 = B(\vec{b}_2, \lambda)
\end{aligned}$$

From the definition of the expectation value,
we know that the difference is given by:

$$\begin{aligned}
    \Expval{A_1 B_1} - \Expval{A_1 B_2}
    = \int \rho(\lambda) \Big( A_1 B_1 - A_1 B_2 \Big) \dd{\lambda}
\end{aligned}$$

We introduce some new terms and rearrange the resulting expression:

$$\begin{aligned}
    \Expval{A_1 B_1} - \Expval{A_1 B_2}
    &= \int \rho(\lambda) \Big( A_1 B_1 - A_1 B_2 \pm A_1 B_1 A_2 B_2 \mp A_1 B_1 A_2 B_2 \Big) \dd{\lambda}
    \\
    &= \int \rho(\lambda) A_1 B_1 \Big( 1 \pm A_2 B_2 \Big) \dd{\lambda}
    - \!\int \rho(\lambda) A_1 B_2 \Big( 1 \pm A_2 B_1 \Big) \dd{\lambda}
\end{aligned}$$

Taking the absolute value of both sides
and invoking the triangle inequality then yields:

$$\begin{aligned}
    \Big| \Expval{A_1 B_1} - \Expval{A_1 B_2} \Big|
    &= \bigg|\! \int \rho(\lambda) A_1 B_1 \Big( 1 \pm A_2 B_2 \Big) \dd{\lambda}
    - \!\int \rho(\lambda) A_1 B_2 \Big( 1 \pm A_2 B_1 \Big) \dd{\lambda} \!\bigg|
    \\
    &\le \bigg|\! \int \rho(\lambda) A_1 B_1 \Big( 1 \pm A_2 B_2 \Big) \dd{\lambda} \!\bigg|
    + \bigg|\! \int \rho(\lambda) A_1 B_2 \Big( 1 \pm A_2 B_1 \Big) \dd{\lambda} \!\bigg|
\end{aligned}$$

Using the fact that the product of $$A$$ and $$B$$ is always either $$-1$$ or $$+1$$,
we can reduce this to:

$$\begin{aligned}
    \Big| \Expval{A_1 B_1} - \Expval{A_1 B_2} \Big|
    &\le \int \rho(\lambda) \Big| A_1 B_1 \Big| \Big( 1 \pm A_2 B_2 \Big) \dd{\lambda}
    + \!\int \rho(\lambda) \Big| A_1 B_2 \Big| \Big( 1 \pm A_2 B_1 \Big) \dd{\lambda}
    \\
    &\le \int \rho(\lambda) \Big( 1 \pm A_2 B_2 \Big) \dd{\lambda}
    + \!\int \rho(\lambda) \Big( 1 \pm A_2 B_1 \Big) \dd{\lambda}
\end{aligned}$$

Evaluating these integrals gives us the following inequality,
which holds for both choices of $$\pm$$:

$$\begin{aligned}
    \Big| \Expval{A_1 B_1} - \Expval{A_1 B_2} \Big|
    &\le 2 \pm \Expval{A_2 B_2} \pm \Expval{A_2 B_1}
\end{aligned}$$

We should choose the signs such that the right-hand side is as small as possible, that is:

$$\begin{aligned}
    \Big| \Expval{A_1 B_1} - \Expval{A_1 B_2} \Big|
    &\le 2 \pm \Big( \Expval{A_2 B_2} + \Expval{A_2 B_1} \Big)
    \\
    &\le 2 - \Big| \Expval{A_2 B_2} + \Expval{A_2 B_1} \Big|
\end{aligned}$$

Rearranging this and once again using the triangle inequality,
we get the CHSH inequality:

$$\begin{aligned}
    2
    &\ge \Big| \Expval{A_1 B_1} - \Expval{A_1 B_2} \Big| + \Big| \Expval{A_2 B_2} + \Expval{A_2 B_1} \Big|
    \\
    &\ge \Big| \Expval{A_1 B_1} - \Expval{A_1 B_2} + \Expval{A_2 B_2} + \Expval{A_2 B_1} \Big|
\end{aligned}$$

The quantity on the right-hand side is sometimes called the **CHSH quantity** $$S$$,
and measures the correlation between the spins of $$A$$ and $$B$$:

$$\begin{aligned}
    \boxed{
        S \equiv \Expval{A_2 B_1} + \Expval{A_2 B_2} + \Expval{A_1 B_1} - \Expval{A_1 B_2}
    }
\end{aligned}$$

The CHSH inequality places an upper bound on the magnitude of $$S$$
for LHV-based theories:

$$\begin{aligned}
    \boxed{
        |S| \le 2
    }
\end{aligned}$$


## Tsirelson's bound

Quantum physics can violate the CHSH inequality, but by how much?
Consider the following two-particle operator,
whose expectation value is the CHSH quantity, i.e. $$S = \expval{\hat{S}}$$:

$$\begin{aligned}
    \hat{S}
    = \hat{A}_2 \otimes \hat{B}_1 + \hat{A}_2 \otimes \hat{B}_2 + \hat{A}_1 \otimes \hat{B}_1 - \hat{A}_1 \otimes \hat{B}_2
\end{aligned}$$

Where $$\otimes$$ is the tensor product,
and e.g. $$\hat{A}_1$$ is the Pauli matrix for the $$\vec{a}_1$$-direction.
The square of this operator is then given by:

$$\begin{aligned}
    \hat{S}^2
    = \quad &\hat{A}_2^2 \otimes \hat{B}_1^2 + \hat{A}_2^2 \otimes \hat{B}_1 \hat{B}_2
    + \hat{A}_2 \hat{A}_1 \otimes \hat{B}_1^2 - \hat{A}_2 \hat{A}_1 \otimes \hat{B}_1 \hat{B}_2
    \\
    + &\hat{A}_2^2 \otimes \hat{B}_2 \hat{B}_1 + \hat{A}_2^2 \otimes \hat{B}_2^2
    + \hat{A}_2 \hat{A}_1 \otimes \hat{B}_2 \hat{B}_1 - \hat{A}_2 \hat{A}_1 \otimes \hat{B}_2^2
    \\
    + &\hat{A}_1 \hat{A}_2 \otimes \hat{B}_1^2 + \hat{A}_1 \hat{A}_2 \otimes \hat{B}_1 \hat{B}_2
    + \hat{A}_1^2 \otimes \hat{B}_1^2 - \hat{A}_1^2 \otimes \hat{B}_1 \hat{B}_2
    \\
    - &\hat{A}_1 \hat{A}_2 \otimes \hat{B}_2 \hat{B}_1 - \hat{A}_1 \hat{A}_2 \otimes \hat{B}_2^2
    - \hat{A}_1^2 \otimes \hat{B}_2 \hat{B}_1 + \hat{A}_1^2 \otimes \hat{B}_2^2
    \\
    = \quad &\hat{A}_2^2 \otimes \hat{B}_1^2 + \hat{A}_2^2 \otimes \hat{B}_2^2 + \hat{A}_1^2 \otimes \hat{B}_1^2 + \hat{A}_1^2 \otimes \hat{B}_2^2
    \\
    + &\hat{A}_2^2 \otimes \acomm{\hat{B}_1}{\hat{B}_2} - \hat{A}_1^2 \otimes \acomm{\hat{B}_1}{\hat{B}_2}
    + \acomm{\hat{A}_1}{\hat{A}_2} \otimes \hat{B}_1^2 - \acomm{\hat{A}_1}{\hat{A}_2} \otimes \hat{B}_2^2
    \\
    + &\hat{A}_1 \hat{A}_2 \otimes \comm{\hat{B}_1}{\hat{B}_2} - \hat{A}_2 \hat{A}_1 \otimes \comm{\hat{B}_1}{\hat{B}_2}
\end{aligned}$$

Spin operators are unitary, so their square is the identity,
e.g. $$\hat{A}_1^2 = \hat{I}$$. Therefore $$\hat{S}^2$$ reduces to:

$$\begin{aligned}
    \hat{S}^2
    &= 4 \: (\hat{I} \otimes \hat{I}) + \comm{\hat{A}_1}{\hat{A}_2} \otimes \comm{\hat{B}_1}{\hat{B}_2}
\end{aligned}$$

The *norm* $$\norm{\hat{S}^2}$$ of this operator
is the largest possible expectation value $$\expval{\hat{S}^2}$$,
which is the same as its largest eigenvalue.
It is given by:

$$\begin{aligned}
    \Norm{\hat{S}^2}
    &= 4 + \Norm{\comm{\hat{A}_1}{\hat{A}_2} \otimes \comm{\hat{B}_1}{\hat{B}_2}}
    \\
    &\le 4 + \Norm{\comm{\hat{A}_1}{\hat{A}_2}} \Norm{\comm{\hat{B}_1}{\hat{B}_2}}
\end{aligned}$$

We find a bound for the norm of the commutators by using the triangle inequality, such that:

$$\begin{aligned}
    \Norm{\comm{\hat{A}_1}{\hat{A}_2}}
    = \Norm{\hat{A}_1 \hat{A}_2 - \hat{A}_2 \hat{A}_1}
    \le \Norm{\hat{A}_1 \hat{A}_2} + \Norm{\hat{A}_2 \hat{A}_1}
    \le 2 \Norm{\hat{A}_1 \hat{A}_2}
    \le 2
\end{aligned}$$

And $$\norm{\comm{\hat{B}_1}{\hat{B}_2}} \le 2$$ for the same reason.
The norm is the largest eigenvalue, therefore:

$$\begin{aligned}
    \Norm{\hat{S}^2}
    \le 4 + 2 \cdot 2
    = 8
    \quad \implies \quad
    \Norm{\hat{S}}
    \le \sqrt{8}
    = 2 \sqrt{2}
\end{aligned}$$

We thus arrive at **Tsirelson's bound**,
which states that quantum mechanics can violate
the CHSH inequality by a factor of $$\sqrt{2}$$:

$$\begin{aligned}
    \boxed{
        |S|
        \le 2 \sqrt{2}
    }
\end{aligned}$$

Importantly, this is a *tight* bound,
meaning that there exist certain spin measurement directions
for which Tsirelson's bound becomes an equality, for example:

$$\begin{aligned}
    \hat{A}_1 = \hat{\sigma}_z
    \qquad
    \hat{A}_2 = \hat{\sigma}_x
    \qquad
    \hat{B}_1 = \frac{\hat{\sigma}_z + \hat{\sigma}_x}{\sqrt{2}}
    \qquad
    \hat{B}_2 = \frac{\hat{\sigma}_z - \hat{\sigma}_x}{\sqrt{2}}
\end{aligned}$$

Using the fact that $$\Expval{A_a B_b} = - \vec{a} \cdot \vec{b}$$,
it can then be shown that $$S = 2 \sqrt{2}$$ in this case.


## References
1.  D.J. Griffiths, D.F. Schroeter,
    *Introduction to quantum mechanics*, 3rd edition,
    Cambridge.
2.  J.B. Brask,
    *Quantum information: lecture notes*,
    2021, unpublished.