71. Motion of a Spin One-Half in a Slowly-Varying Magnetic Field:
The Key to Measuring the Muon Moment (g $-$ 2)

Jackson 11.11, plus discussion of g-2.

Introduction

It turns out that when an electron or a muon moves through a magnetic field, the spin direction closely follows the velocity direction. The small difference in rotation rate, of order one part in a thousand, can be measured extremely accurately $—$ so well that contributions from strong interactions (as explained below) can be detected and measured. That is to say, this experiment, with no hadrons in sight, can be used to test quantum chromodynamics calculations, and thereby improve our (inadequate) understanding of strong interactions. The experiment is judged sufficiently important that the recent (2022) annual dollar cost runs to nine figures.

Nonrelativistic Limit: the g-factor

The plan is to track how the direction of the spin vector changes as an electron (or muon) moves along a given path through a slowly varying, time independent magnetic field. (The spin, of course, has an accompanying magnetic moment.)

First, the unit for measuring particle magnetic moments is the Bohr magneton, the magnetic moment of the current loop corresponding to an electron in the lowest Bohr model orbit, the $l = 1$ orbit having angular momentum $ℏ .$ (We’re just explaining the origin of the term $—$ we know now of course that Bohr’s model was flawed, even though it correctly predicted energy differences: the lowest actual hydrogen atom state has $l = 0.$ ) For this simple “classical” Bohr picture, thinking of the electron as a particle of mass $m$ and charge $e$ moving in a circle of radius $r$ at speed $v,$ the angular momentum is $m v r = ℏ,$ and the electric current is $e v / 2 π r$ so this “current loop” has magnetic moment current x area (see lecture 31)

$μ_{B} = I A = e v / (2 π r) π r^{2} = \frac{1}{2} e v r = e ℏ / 2 m .$

Experimentally, the electron has an intrinsic angular momentum (spin) $s_{z} = \pm \frac{1}{2} ℏ,$ and accompanying magnetic moment (SI, gaussian has a c in the denominator)

$\vec{μ} = (g e / 2 m) \vec{s}, g ≅ 2.002...$

Note that if the electron is modeled as a classical spinning charge distribution, that would give $g = 1,$ just as for the Bohr orbit above. On the other hand, Dirac’s linear, relativistically-invariant, equation predicts $g = 2,$ very close to that observed, an impressive feat. The further 0.002… correction is mainly a quantum electrodynamic effect, and theoretical predictions based on that theory give close agreement with experiment, as discussed in detail below.

Electron Spin Precession, Classical and Quantum

We’ll consider first an electron at the origin in a constant magnetic field $\vec{B} = B \hat{\vec{z}} .$ It has a magnetic energy $U = - \vec{μ} \cdot \vec{B},$ and the field exerts a couple $\vec{μ} \times \vec{B}$ on the particle, causing precession around the magnetic field direction:

$d \vec{s} / d t = \vec{μ} \times \vec{B} = (g e / 2 m) \vec{s} \times \vec{B},$

with a frequency $ω = g e B / 2 m .$

Exercise: Check this by drawing $d \vec{s}$ on the diagram.

To reassure ourselves that this classical result is also good for the electron quantum spin one-half (following Landau) we write the Hamiltonian for the spin one-half magnetic field interaction in terms of the Pauli matrices,

$\hat{H} = {\hat{H}}_{0} - μ \vec{σ} \cdot \vec{B}, \vec{s} = \frac{1}{2} ℏ \vec{σ},$

with ${\hat{H}}_{0}$ the spin-independent terms, $μ = g e ℏ / 4 m_{e}$ (SI units), and $\vec{B}$ the magnetic field at the particle position.

We’ll for the present take $g = 2,$ and address the (important!) difference later.

The operator equation of motion is, writing spin $\vec{s} = \frac{1}{2} ℏ \vec{σ},$ (Landau, p 152)

$\dot{\vec{s}} = \frac{i}{2 ℏ} (\hat{H} \vec{σ} - \vec{σ} \hat{H}),$

from which

$\begin{matrix} {\dot{s}}_{i} = - \frac{i μ}{2 ℏ} B_{k} (σ_{k} σ_{i} - σ_{i} σ_{k}) \\ = - \frac{μ}{ℏ} ε_{i k l} B_{k} σ_{l} \\ = \frac{2 μ}{ℏ} {(\vec{s} \times \vec{B})}_{i} = \frac{g e}{2 m} {(\vec{s} \times \vec{B})}_{i} \end{matrix}$

This is in fact identical to the classical equation!

Bottom line:

The spin precesses about the field direction with angular velocity $- g e B / 2 m .$

This is Jackson 11.155, in SI units, although his is for the rest frame (and his $t^{'}$ is time in that frame, in preparation for going to the relativistic case).

For an electron moving through a slowly varying magnetic field, this will give the current spin precession rate.

But the velocity vector is also “precessing” (remember its magnitude is constant if there is no electric field), and (nonrelativistically), it satisfies

$m \frac{d \vec{v}}{d t} = e \vec{v} \times \vec{B} .$

That is, the velocity vector rotates about $\vec{B}$ at angular velocity $- e B / m .$

Comparing the two: IF $g = 2, μ = e ℏ / 2 m,$

the spin direction and the velocity direction precess at the same rate

meaning the spin polarization vector is at a constant angle to the direction of motion.

But $g \neq 2,$ and spin observation is the key to finding the small quantity $g - 2 :$ the angle between spin direction and velocity direction will slowly change, and that rate of change can be measured experimentally.

Relativistic Generalization

Remarkably, it turns out that this constant angle between spin and momentum (for motion in a magnetic field) is still valid relativistically, provided $μ = e ℏ / 2 m,$ meaning $g = 2.$

To prove this, though, we need to write the spin equation of motion in a relativistically invariant way. Clearly we must go to proper time $τ,$ and we must promote the spin $\vec{s}$ to a four-vector, by constructing the four-vector that reduces to the spin three-vector in the particle rest frame.

(Notation warning: We’ll follow Jackson’s notation, but it needs care $—$ it’s font case dependent, lower case is used for spin in the rest frame, upper case for lab frame.

Landau avoids this by writing the four-spin as $a^{α},$ ( $a$ for axial, I think, because, like angular momentum, it’s even under reflection $\vec{x} \to - \vec{x} .$ ) However, in doing this same problem Garg uses $a^{α}$ for the four-acceleration $—$ the standard notation. Zangwill doesn’t cover any of this.

Generalizing the Spin to Four Dimensions

Following Jackson, section 11.11A, we define a four-vector $S^{α} = (S^{0}, \vec{S})$ in inertial frame $K$ by requiring that in the particle’s rest frame $K^{'}$ it is $(0, {\vec{s}}^{'}) .$ That is, in the particle rest frame the spin time component $S^{0}^{'} = 0.$

The four-spin components $S^{α}$ in other frames then follow from the usual vector Lorentz transformations.

Note that since the particle’s four-velocity in the rest frame $U^{α} = (c, 0, 0, 0)$ , and the inner product $U^{α} S_{α}$ is Lorentz invariant, $U^{α} S_{α} = 0$ in all frames.

*Four-Spin in Different Frames

This section (which follow Jackson) can be skipped at first reading.

In preparation for looking at the spin in different frames, Jackson next rewrites his earlier transformation equations (11.22) for an arbitrary four-vector $A^{α}$ (identical of course to the equations for the position coordinate vector). He uses the standard notation of denoting with $A_{∥}, {\vec{A}}_{⊥}$ the three-vector components parallel and perpendicular to the relative velocity $\vec{β} c$ of the two frames. The equations can then be written (his 11.22)

$\begin{matrix} {A^{'}}_{0} = γ (A_{0} - β A_{∥}), \\ {A^{'}}_{∥} = γ (A_{∥} - β A_{0}), \\ {\vec{A^{'}}}_{⊥} = {\vec{A}}_{⊥} . \end{matrix}$

Exercise: Check this is correct for the relative velocity along the $z$ -axis.

Translating this to the spin components in the two frames, in the spin rest-frame $K^{'}$ the four-vector

$({S^{'}}_{0}, {\vec{s}}^{'}) = (0, {\vec{s}}^{'}) \equiv ({A^{'}}_{0}, {\vec{A}}^{'}),$

so the first (time component) spin equation (transcribing from the ${A^{'}}_{0}$ equation above) is

$S_{0}^{'} = γ (S_{0} - \vec{β} \cdot \vec{S}),$

and since $S_{0}^{'} = 0$ (by definition),

$S_{0} = \vec{β} \cdot \vec{S} .$

Putting this in the second equation gives

${\vec{s}}_{∥}^{'} = γ ({\vec{S}}_{∥} - β S_{0}),$

${\vec{s}}^{'} \cdot \vec{β} = γ (\vec{S} \cdot \vec{β} - β^{2} (\vec{S} \cdot \vec{β})) .$

(Notation: Unlike Jackson, we’ve put a prime on ${\vec{s}}^{'}$ to remind ourselves it’s in the spin rest frame $K^{'} .$ )

In fact, from the third equation, the components of ${\vec{s}}^{'}, \vec{S}$ are identical except in the $\vec{β}$ (longitudinal) direction, so it follows from the above that (Jackson 11.158)

${\vec{s}}^{'} = \vec{S} - (\frac{γ}{γ + 1}) (\vec{β} \cdot \vec{S}) \vec{β} .$

Exercise: Prove it.

The inverse expressions are (all from Jackson)

$\begin{matrix} \vec{S} = {\vec{s}}^{'} + \frac{γ^{2}}{γ + 1} (\vec{β} \cdot {\vec{s}}^{'}) \vec{β}, \\ S_{0} = γ \vec{β} \cdot {\vec{s}}^{'} . \end{matrix}$

Relativistic Spin Precession: the BMT Equation

We’re now ready to construct a Lorentz invariant equation for the (proper) time development of the four-spin, $d S^{α} / d τ .$ Here we’ll follow Landau’s (only slightly different) presentation (page 153, QED Vol 4, 2^nd ed). The expression must be linear and homogeneous in $F^{α β}$ and $S^{α},$ and the only other variable is $U^{α} .$ (Jackson further includes $d U^{α} / d τ,$ but that is $(e / m) F^{α β} U_{β}$ , so adds nothing new).

With these requirements, the equation must be

$\frac{d S^{α}}{d τ} = A F^{α β} S_{β} + B U^{α} F^{β γ} U_{β} S_{γ},$

with $A, B$ constant coefficients. Landau remarks that with $U_{α} S^{α} = 0,$ and $F^{α β}$ antisymmetric so $F^{α β} U_{α} U_{β} = 0,$ it’s not difficult to check that there can be no other linear homogeneous terms (try writing one).

In the nonrelativistic limit, from our definition of the four-dimensional spin (that it’s $(0, {\vec{s}}^{'})$ in the rest frame) the left-hand side will only have spatial components, and in this limit the second term on the right hand side won’t contribute, since $U^{α} \to (c, 0, 0, 0),$ (and $U_{α} \to (- c, 0, 0, 0)$ with our metric -+++).

We already know the equation must become

${\dot{s}}_{i} = \frac{2 μ}{ℏ} {(\vec{s} \times \vec{B})}_{i},$

so that fixes the first unknown constant $A :$

$A = \frac{2 μ}{ℏ} = \frac{g e}{2 m} .$

The remaining task is to find the constant $B .$

The trick is to differentiate $U^{α} S_{α} = 0$ :

$\frac{d}{d τ} (U_{α} S^{α}) = S^{α} \frac{d U_{α}}{d τ} + U_{α} \frac{d S^{α}}{d τ} = 0$

and then insert the equation of motion

$m \frac{d U^{α}}{d τ} = e F^{α β} U_{β}$

to find

$U_{α} \frac{d S^{α}}{d τ} = - S_{α} \frac{d U^{α}}{d τ} = - S_{α} \frac{e}{m} F^{α β} U_{β} = \frac{e}{m} F^{α β} U_{α} S_{β}$

where we switch up-down index pairs to down-up, and used the antisymmetry of $F^{α β}$ in the last step.

Now, recalling $d S^{α} / d τ = A F^{α β} S_{β} + B U^{α} F^{β γ} U_{β} S_{γ},$ and multiplying both sides by $U_{α},$ remembering that $U^{α} U_{α} = - c^{2}$

$\frac{e}{m} F^{α β} U_{α} S_{β} = A F^{α β} U_{α} S_{β} - c^{2} B F^{β γ} U_{β} S_{γ} .$

Notice that all indices are dummies, so all terms have identical structure, and recalling $A = g e / 2 m,$ we have

$\frac{e}{m} = \frac{g e}{2 m} - B c^{2},$

and

$B = \frac{e}{2 m c^{2}} (g - 2) .$

Now that we’ve found $A$ and $B,$ the relativistic spin precession equation

$\frac{d S^{α}}{d τ} = A F^{α β} S_{β} + B U^{α} F^{β γ} U_{β} S_{γ}$

can be written

$\frac{d S^{α}}{d τ} = \frac{g e}{2 m} F^{α β} S_{β} + \frac{e}{2 m c^{2}} (g - 2) U^{α} F^{β γ} U_{β} S_{γ} .$

(Jackson 11.164, he has an extra c in the denominator because he’s using gaussian units here.)

Comparing this with the equation of motion

$m \frac{d U^{α}}{d τ} = e F^{α β} U_{β}$

we see directly that for $g = 2$ the four-vectors for spin and velocity precess together, generalizing the nonrelativistic result that for $g = 2$ the angle between spin and velocity is constant. We can also see that for $g = 2.002...$ the angle will change only slowly, and the rate of change is a direct measure of $g - 2.$

The spin precession equation above is called the BMT equation, found by Bargmann, Michel and Telegdi in 1959 (but previously found by Thomas in 1927 and Frenkel in 1926).

The rest of this lecture, an outline of the relevant particle physics, is optional.

*Theoretical Calculation of Electron and Muon Magnetic Moments

As mentioned at the beginning of the lecture, the first naïve ideas about these moments, based on the misconception that the electron spin $\vec{s}$ was equivalent to some classical rotating charge distribution, predicted that $\vec{μ} = (g e / 2 m) \vec{s}$ with (wrongly) $g = 1.$ The first relativistic wave equation for a spin one-half particle was Dirac’s in 1928. It impressively gave $g = 2,$ only off by a part in a thousand or so $—$ but nevertheless off.

The big breakthrough came in 1949 with Schwinger’s quantum electrodynamic calculation that to leading order in perturbation theory

$\frac{g}{2} = 1 + \frac{α}{2 π},$

where the fine structure constant $α ≅ 1 / 137,$ so $g ≅ 2.00232.$

This result, with $g$ now correct to about a part in a million, gained Schwinger the Nobel prize, and is written on his gravestone.

In a later review paper, Schwinger attributed the achievement to “George Green and I” (as we mentioned earlier in the lecture on Green’s functions).

However, this is not the end of the story. The experimental results are much more precise, challenging theorists to compute higher orders of perturbation theory.

Just to give a taste of the quantum electrodynamics involved, in Feynman diagram language the electron (or muon) is represented as a solid line, the electromagnetic field is a photon $—$ but off its mass shell, meaning non-propagating. (For example, a static field is built of such photons). The interaction is called a “vertex” (see figure).

The first order correction, the $α / 2 π$ calculated by Schwinger, in this diagram language corresponds to the emission and subsequent reabsorption of a virtual photon bridging the magnetic interaction (as in the diagram on the right).

Many higher order quantum electrodynamic diagrams can be drawn. An important one turns out to be that of the “bridging” photon line (which is of course an electromagnetic field) polarizing the vacuum, creating virtual electron-positron pairs, or muon-antimuon pairs, as shown on the left here.

Including all the second-order QED diagrams (which means a lot of calculation, see Landau) gives for the magnetic moment of the electron

$μ_{e} = {\frac{e ℏ}{2 m}}_{e} (1 + \frac{α}{2 π} - 0.328 \frac{α^{2}}{π^{2}}) .$

Noting that the electron mass does not appear inside the bracket, one might conclude that the same second-order expression would work for the muon, just by changing the outside mass term. However, this is not quite right: the problem is the vacuum polarization loop. For the electron, this calculation includes the electron-positron loop, which is fine, because the muon-antimuon loop makes a much smaller contribution. However, in calculating the muon magnetic moment, this same expression includes the muon-antimuon loop (which contributes around 2% of the total), but does not include the electron-positron loop, which is more important since the lighter virtual particles are more easily excited.

Now including the electron-positron polarization loop, the muon magnetic moment to second order is

$μ_{μ} = \frac{e ℏ}{2 m_{μ} c} (1 + \frac{α}{2 π} + 0.76 \frac{α^{2}}{π^{2}}) .$

These results are often presented in terms of the parameter

$a = \frac{g - 2}{2} .$

The original calculation gives $a_{μ} = 0.00116141,$ the second-order term shown above brings this to $a_{μ} = 0.00116551.$ (Not great notation, but standard: watch out for $a, α$ !)

The muon-antimuon loop contributes of order $0.0000001.$ As we shall see, the experiment delivers greater accuracy than the numbers we have given so far $—$ the current experimental result is $a_{μ} = 0.001165920.$ But the higher order graphs in quantum electrodynamics (meaning interactions of photons, electrons, positrons, muons and antimuons) solved in perturbation theory are multiplied by powers of the fine structure constant, $1 / 137,$ and are not big enough to reproduce the measured result.

Enter the Standard Model

What are we missing? Well, we’ve ignored a lot of other particles. We found a significant contribution to the vacuum polarization term from the muon-antimuon loop, but the (charged) pion has a mass not much greater that the muon, so surely there has to be a pion-antipion loop too? But it’s not that simple. The pion is strongly interacting, you can’t treat the strong interactions with perturbation theory, so the loop will be a strongly interacting mess, we represent it by a blob in the fourth diagram (as do others). However, this is what makes it interesting. We have a very exact result from experiment, so this is a good test of various theoretical attempt to analyze the strong interactions quantitatively. In particular, recent results using lattice gauge theory look good.

(Another approach we mention only for experts: the imaginary part of the polarization amplitude is given by all the hadronic blob components being on their mass shell, i.e. real, not virtual, particles. And, thanks to the magic of Feynman diagrams, this same graph represents an electron meeting a positron and annihilating into a photon which generates a spray of hadrons, as depicted here. This process occurs and these amplitudes have been measured. Then, by analytic continuation we can (with effort!) get to the hadronic contribution to the muon magnetic moment.)

This new insight into the murky world of strong interactions is a major reason the $g - 2$ experiment is worth doing.

The Experiment

If a muon is injected into a uniform magnetic field, in a direction perpendicular to the field, it will follow a circular path. If the injected muon has its spin aligned with its direction of motion (“polarized”), then, from the discussion above, after a complete circle the velocity vector will have rotated through 360 degrees, for $g = 2$ the spin would have done the same, but with $g - 2$ of order 0.001, the spin will be at a small angle (about a third of a degree) to the direction of motion, and this deviation increases linearly with the number of orbits. When the muon decays (its lifetime is about 2 microseconds) it emits an electron in the direction its spin was pointing, so by detecting this electron the accumulated change of spin direction can be found, and thus $g - 2$ measured.

In the current experiment at Fermilab, polarized muons having energy about 3.1 GeV are injected into a storage ring of diameter fifty feet in a perpendicular uniform magnetic field of strength 1.45 Tesla. Since the muons are traveling essentially at the speed of light (one foot per nanosecond!) one circuit takes about 0.15 microseconds.

The muon mass is around $100 MeV / c^{2},$ so the relativistic $γ ≃ 30.$ The muon lifetime is 2 microseconds in its rest frame, so the lab lifetime is around 60 milliseconds, in that time it would go around thousands of times, there will evidently be large total precession angles.

Further technical info (from Stefan Baessler): More precisely, the experiment is run at $γ = 29.3,$ referred to as the magic gamma, because it turns out that the effect of the motional magnetic field from the focussing electric quadrupoles on the precession vanishes at that value.

History of the Experiment

Measuring the muon magnetic moment to this accuracy is the modern equivalent of building a medieval cathedral. The experiment began in 1959, and will not end any time soon. Many thousands of highly skilled workers have spent years on it, and the cost runs to billions of dollars.

The first project was at CERN, and in the early sixties confirmed Schwinger’s result. The second stage began at Brookhaven Lab, on Long Island, in 1989, a factor of 20 more precise than the CERN experiment. This involved the muon storage ring mentioned above. Finally (possibly!) in 2013 the experiment was moved to Fermilab. Transporting the fifty-foot diameter ring proved challenging: it could not be disassembled. The simplest route from Long Island, NY, to Chicago for such a load turned out to include going by sea around Florida $—$ the details can be found here.

previous index next <

71. Motion of a Spin One-Half in a Slowly-Varying Magnetic Field: The Key to Measuring the Muon Moment (g – 2)