72. Dynamics of a Relativistic Particle in an Electromagnetic Field

Jackson Chapter 12.1: I’ve used Gaussian units in this lecture, to connect better with Jackson.

Michael Fowler UVa

The Facts

It is well-established experimentally that the equations of motion for a charged particle in an electromagnetic field are (following Jackson, so Gaussian (cgs) units here, hence the $c$ )

$\begin{array}{l} \frac{d \vec{p}}{d t} = e (\vec{E} + \frac{\vec{v}}{c} \times \vec{B}), \\ \frac{d E}{d t} = e \vec{v} \cdot \vec{E} . \end{array}$

These can be written in standard relativistic notation

$\frac{d U^{α}}{d τ} = \frac{e}{m c} F^{α β} U_{β}$

where $τ$ is the proper time, $d t = γ d τ, γ = 1 / \sqrt{1 - v^{2} / c^{2}},$ $U^{α}$ is the four-velocity $U^{α} = (γ c, γ \vec{v}) = p^{α} / m,$ and $F^{α β}$ is the field tensor (for its definition see lecture 65).

Exercise: The equivalence of these two sets of equations was given as an exercise at the end of lecture 65. If you didn’t do it then, it would be a good idea to do it now.

The Theory: The Principle of Least Action

We first briefly review the formalism for a general dynamical system, degrees of freedom labeled by coordinates $q_{i} .$ The state at a given moment is specified completely by the set of positions and velocities $(q_{i}, {\dot{q}}_{i}),$ a point in configuration space. The system’s evolution in time, determined by Lagrange’s equations, traces a path in this configuration space from a given initial state to a final state.

The action corresponding to this path is the integral

$S = \int_{t_{1}}^{t_{2}} L (q_{i}, {\dot{q}}_{i}) d t$

where the usual Lagrangian $L (q_{i}, {\dot{q}}_{i}) = T - V,$ the difference of kinetic and potential energies (but we’ll find a rather different expression for a charged particle in a magnetic field, see below).

The actual physical path in configuration space followed by the evolving system has the least action of all the possible paths between the given initial and final states. Minimization of the action using the calculus of variations, i.e. varying the path a little, yields Lagrange’s equations of motion.

This is standard classical mechanics, but here we want to include relativistic motion of a particle in a field. As we’ve discussed elsewhere, the Principle of Least Action comes ultimately from the wave nature of matter, an argument that naturally extends to the relativistic regime. We just have to figure out what relativistic action yields the dynamics we already know from experiment.

A Free Particle

We start with a free particle and no potential. The action cannot depend on the coordinate system chosen $—$ it must be Lorentz invariant. But all we have is a point particle and a path, the action being an integration over path increments. There is just one Lorentz invariant for a path increment: its interval, meaning its proper incremental time $d τ .$ The obvious candidate for the action, then, has to be

$S = \int_{t_{1}}^{t_{2}} L d t = - α \int_{a}^{b} c d τ,$

the particle’s (Lorentz invariant) proper time, and $α$ some as yet undetermined constant. Why have we put in a minus sign? Because we know moving clocks run slow, so if we take all possible paths beginning at the origin, then ending at the origin one minute later (as measured by a clock stationed at the origin), the trivial path of just staying at the origin takes the longest measured elapsed time (any other path needs some movement, and hence clock slowing). The motionless path is the extremum, and, of course, it satisfies the equation of motion in zero field.

We can determine the overall constant $α$ in the action by matching to the known result in the nonrelativistic limit, where the Lagrangian $L = \frac{1}{2} m v^{2} .$

Putting $d t = γ d τ,$ from the above action integral $L d t = - α c d τ,$ so

$L = α c \frac{d τ}{d t} = - α c \sqrt{1 - \frac{v^{2}}{c^{2}}} \approx - α c + \frac{α v^{2}}{2 c} .$

The constant term is irrelevant to the minimization, matching the second term gives $α = m c,$ so the action is

$S = - m c^{2} \int_{a}^{b} d τ,$

and

$L = - m c^{2} \sqrt{1 - \frac{v^{2}}{c^{2}}} .$

Having found the Lagrangian, we can derive momentum and energy in the standard fashion:

The momentum

$\vec{p} = \frac{\partial L}{\partial \vec{v}} = \frac{m \vec{v}}{\sqrt{1 - v^{2} / c^{2}}},$

and the energy (the Hamiltonian)

$E = \vec{p} \cdot \vec{v} - L = \frac{m c^{2}}{\sqrt{1 - v^{2} / c^{2}}} .$

*Making Coordinates Explicit

The idea here (Jackson 12.1.B) is to escape the nonrelativistic perspective $—$ here we’ll treat time as just another coordinate, like the position coordinates, and introducing a further parameter $s,$ which labels position along the particle’s path in spacetime, $x^{μ} (s),$ and is strictly increasing. (Of course, labeling the path using proper time, as in the previous section, also treats ordinary time and space equivalently in the relativistic context, so the present section doesn’t really add much further insight, we’ve put it in to connect better with Jackson.)

Notation warning: we are following Jackson (p 584, 3^rd edition) in introducing this parameter $s .$ Unfortunately, Landau (Vol 2, Ch 2) uses $s$ for the proper time (as do some others), which we label $τ .$

The usual nonrelativistic derivation of the equations of motion of a system by minimizing action takes an arbitrary small coordinate deviation $δ (x_{i})$ from the path and requires that the action not change to first order. We’ll do the same, but obviously to proceed it is necessary that the $x^{μ}$ appear in the Lagrangian. With our limited options, that means $d τ$ must be written as a function of the $d x^{μ}'s .$

Now $d τ$ is the incremental invariant interval (and to make the equations coincide with Jackson’s, here we’ll take metric (+ - - -) )

$d τ = \sqrt{g^{α β} d x_{α} d x_{β}},$

so the action, bringing in our path label parameter $s,$

$S = - m c \int_{s_{1}}^{s_{2}} \frac{d τ}{d s} d s = - m c \int_{s_{1}}^{s_{2}} \sqrt{g^{α β} \frac{d x_{α}}{d s} \frac{d x_{β}}{d s}} d s .$ positions $x_{α}$ and “velocities” $d x_{α} / d s .$ Following the exact variational procedure that gives Lagrange’s equations we find

$m c \frac{d}{d s} [\frac{d x^{α} / d s}{\sqrt{\frac{d x_{β}}{d s} \frac{d x^{β}}{d s}}}] = 0.$

At this point we note that since $s$ is a strictly increasing smooth function of $τ,$ the expression in the square brackets is equal to the same with $d s$ replaced by $d τ$ everywhere:

$[\frac{d x^{α} / d s}{\sqrt{\frac{d x_{β}}{d s} \frac{d x^{β}}{d s}}}] = [\frac{d x^{α} / d τ}{\sqrt{\frac{d x_{β}}{d τ} \frac{d x^{β}}{d τ}}}] .$

In the second expression, the denominator is the (constant) norm of the four-velocity, $\sqrt{U_{β} U^{β}},$ so the variational equation becomes

$m c \frac{d}{d s} [\frac{d x^{α} / d τ}{c}] = 0.$

That is, the bracketed expression is constant as a function of $s,$ therefore also as a function of $τ,$ and we conclude that

$m \frac{d^{2} x^{α}}{d τ^{2}} = 0.$

Again we’ve proved that the physical path between two points in space time (with no potentials present) is one at constant velocity, that being the path of maximum proper time. Surprise.

Note: Comparing the path in four-dimensional spacetime labeled by the parameter $s (x_{α})$ with the path of a nonrelativistic particle in three-dimensional space labeled with time $t (\vec{r}),$ it looks as if we have an extra degree of freedom. But we don’t. The path label function $s$ is arbitrary, provided only that it is increasing in proper time: all such functions give the same identical physics, there isn’t really a degree of freedom, it’s more like choosing a different gauge.

Particle in an Electromagnetic Field

Recall that for a free particle, we derived the motion from an action integral along the path, and the only Lorentz-invariant term for an increment of path was the proper time increment (dimensionally adjusted) $m c^{2} d τ .$ Introducing an electromagnetic field, with four potential $A^{μ} = (φ, \vec{A}),$ a possible action is

$S = \int_{a}^{b} L d t = \int_{a}^{b} (- m c^{2} d τ - e A_{μ} d x^{μ}),$

$L = - m c^{2} \sqrt{1 - v^{2} / c^{2}} + e \frac{\vec{v}}{c} \cdot \vec{A} (\vec{r}, t) - e φ (\vec{r}, t) .$

This is confirmed experimentally: that is, it leads to the observed Lorentz force law, $\vec{F} = e (\vec{E} + \frac{\vec{v}}{c} \times \vec{B}),$ as we’ll now demonstrate.

Canonical and Kinetic Momenta

The canonical momentum (using upper-case $\vec{P},$ following Landau and Jackson)

$\vec{P} = \frac{\partial L}{\partial \vec{v}} = \frac{m \vec{v}}{\sqrt{1 - v^{2} / c^{2}}} + \frac{e}{c} \vec{A} (\vec{r}, t) .$

We’ll reserve lower-case $\vec{p}$ for the kinetic momentum, meaning relativistic mass times velocity (the first term on the right-hand side) so the canonical momentum

$\vec{P} = \vec{p} + \frac{e}{c} \vec{A} (\vec{r}, t) .$

Writing this in terms of the spatial components of the four-velocity $U^{μ} = (γ c, γ \vec{v})$ and the four-potential:

$P^{i} = m U^{i} + \frac{e}{c} A^{i}, i = 1, 2, 3.$

The energy (the Hamiltonian)

$E = \vec{P} \cdot \vec{v} - L = \frac{m c^{2}}{\sqrt{1 - v^{2} / c^{2}}} + e φ (\vec{r}, t) .$

This can also be written $P^{0} = m U^{0} + (e / c) A^{0},$ (remember $A^{0} = φ, P^{0} = E / c,$ Jackson p 582) matching the expression for the spatial components $P^{i} = m U^{i} + \frac{e}{c} A^{i},$ to give the four-vector

$P^{μ} = m U^{μ} + \frac{e}{c} A^{μ} .$

Note the energy $E = c P^{0}$ is just the mass energy (relativistic mass), plus the electrostatic potential energy. The magnetic field does no work on the particle, so doesn't appear in the energy. This is just the particle’s Hamiltonian expressed as a function of position and velocity. But if you want to use this Hamiltonian to derive Hamilton's equations of motion, it must be written in terms of the canonical momenta, and then the vector potential reappears $—$ as it obviously must, since the magnetic field affects the motion.

The equation of motion from Lagrange’s equations:

$\frac{d}{d t} (\frac{\partial L}{\partial \vec{v}}) = \frac{\partial L}{\partial \vec{r}},$

with $L = - m c^{2} \sqrt{1 - v^{2} / c^{2}} + e \vec{v} \cdot \frac{\vec{A} (\vec{r}, t)}{c} - e φ (\vec{r}, t),$

is (following Landau)

$\frac{d}{d t} (\vec{p} + \frac{e}{c} \vec{A} (\vec{r}, t)) = \frac{e}{c} \vec{\nabla} (\vec{A} \cdot \vec{v}) - e \vec{\nabla} φ .$

The tricky point here is that the vector potential $\vec{A} (\vec{r}, t)$ is (of course) at the position of the particle, so differentiating it with respect to time must also track the particle’s position, that is,

$\frac{d \vec{A} (\vec{r}, t)}{d t} = \frac{\partial \vec{A} (\vec{r}, t)}{\partial t} + (\vec{v} \cdot \vec{\nabla}) \vec{A} (\vec{r}, t),$

sometimes called the convective derivative.

The equation of motion becomes

$\frac{d \vec{p}}{d t} = \frac{e}{c} \vec{\nabla} (\vec{A} \cdot \vec{v}) - \frac{e}{c} (\vec{v} \cdot \vec{\nabla}) \vec{A} (\vec{r}, t) - e \vec{\nabla} φ - \frac{e}{c} \frac{\partial \vec{A} (\vec{r}, t)}{\partial t} .$

Here the $\vec{\nabla}$ is from the $\partial / \partial \vec{r}$ in Lagrange’s equations, where $\vec{r}, \vec{v}$ are independent variables, so $\vec{\nabla}$ doesn’t operate on $\vec{v},$ only on $\vec{A},$ and from a well-known vector identity

$\vec{\nabla} (\vec{A} \cdot \vec{v}) - (\vec{v} \cdot \vec{\nabla}) \vec{A} = \vec{v} \times (\vec{\nabla} \times \vec{A}) = \vec{v} \times \vec{B}$

we have

$\frac{d \vec{p}}{d t} = e (\vec{E} + \frac{\vec{v} \times \vec{B}}{c}),$

as observed experimentally, and therefore confirmation of our conjectured action for a charged particle in an electromagnetic field.

Equation of Motion Directly from Least Action

(following Landau, p 64, his $d s = c d τ .$ It’s worth doing the minimization again, staying now in four dimensions, to see how the field tensor emerges naturally.)

Now

$δ S = δ \int_{a}^{b} (- m c^{2} d τ - \frac{e}{c} A_{μ} d x^{μ}) = 0.$

That is,

$δ S = - \int_{a}^{b} (m c \frac{d x_{μ} d δ x^{μ}}{d τ} + \frac{e}{c} A_{μ} d δ x^{μ} + \frac{e}{c} δ A_{μ} d x^{μ}) = 0.$

Integrating the first two terms by parts, writing $d x_{μ} / d τ = U_{μ}$ and requiring $δ x_{i} (a) = δ x_{i} (b) = 0$ ,

$\int_{a}^{b} (m c d U_{μ} δ x^{μ} + \frac{e}{c} d A_{μ} δ x^{μ} - \frac{e}{c} δ A_{μ} d x^{μ}) = 0.$

Now

$δ A_{μ} = \frac{\partial A_{μ}}{\partial x^{ν}} δ x^{ν}, d A_{μ} = \frac{\partial A_{μ}}{\partial x^{ν}} d x^{ν},$

giving

$\int_{a}^{b} (m c d U_{μ} δ x^{μ} + \frac{e}{c} \frac{\partial A_{μ}}{\partial x^{ν}} δ x^{μ} d x^{ν} - \frac{e}{c} \frac{\partial A_{μ}}{\partial x^{ν}} d x^{μ} δ x^{ν}) = 0.$

Putting the infinitesimals in terms of the four-velocity and proper time,

$d x^{μ} = U^{μ} d τ, d U_{μ} = (d U_{μ} / d τ) d τ,$

we find (switching dummy suffixes in the third term)

$\int_{a}^{b} (m c \frac{d U_{μ}}{d τ} - \frac{e}{c} (\frac{\partial A_{ν}}{\partial x^{μ}} - \frac{\partial A_{μ}}{\partial x^{ν}}) U^{ν}) δ x^{μ} d τ = 0.$

That is, in terms of the electromagnetic field tensor,

$m c \frac{d U^{μ}}{d τ} = \frac{e}{c} F^{μ ν} U_{ν} .$

This is of course just the equation previously derived, now written in four-dimensional form.

What’s the Hamiltonian?

Jackson (page 585) introduces a Hamiltonian $\hat{H} = P_{μ} U^{μ} + L,$ which is Lorentz invariant (following Barut). This is not standard practice: the Hamiltonian is conventionally thought of as the time component of an energy-momentum four-vector, as in Landau (page 49), and that will be our approach in these notes. We will not be discussing Lorentz invariant Hamiltonians.

*An Exercise: Checking Hamilton’s Equations

We’ll check that (using the standard non-Lorentz-invariant Hamiltonian) Hamilton’s approach yields the correct equation of motion:

$H = \vec{P} \cdot \vec{v} - L = \frac{m c^{2}}{\sqrt{1 - v^{2} / c^{2}}} + e φ (\vec{r}, t) .$

(From $L = - m c^{2} \sqrt{1 - v^{2} / c^{2}} + e \vec{v} \cdot \frac{\vec{A} (\vec{r}, t)}{c} - e φ (\vec{r}, t),$ $\vec{P} = m \frac{\vec{v}}{\sqrt{1 - v^{2} / c^{2}}} + \frac{e}{c} \vec{A} .$ )

This must be expressed in the canonical variables, in other words using $\vec{P},$ but not $\vec{v},$ leaving $\vec{r}$ the same. Recall $P^{μ} = m U^{μ} + \frac{e}{c} A^{μ}$ and in particular $P^{0} = m U^{0} + \frac{e}{c} A^{0}$ where $A^{0} = φ, P^{0} = E / c = H / c .$

The four-velocity $U^{μ} = (γ c, γ \vec{v})$ normalizes to $U^{μ} U_{μ} = - c^{2},$ or

${(H - e φ)}^{2} - c^{2} {(\vec{P} - \frac{e}{c} \vec{A})}^{2} = m^{2} c^{4},$

so the Hamiltonian

$H = c \sqrt{{(\vec{P} - e \vec{A} / c)}^{2} + m^{2} c^{2}} + e φ .$

Hamilton’s equations are $\dot{\vec{r}} = \partial H / \partial \vec{P}, \dot{\vec{P}} = - \partial H / \partial \vec{r} .$

That is, first,

$\dot{\vec{r}} = \frac{c (\vec{P} - e \vec{A} / c)}{\sqrt{{(\vec{P} - e \vec{A} / c)}^{2} + m^{2} c^{2}}} = \frac{c \vec{p}}{\sqrt{{\vec{p}}^{2} + m^{2} c^{2}}} = \frac{c^{2} \vec{p}}{E} .$

(Recall $E^{2} = m^{2} c^{4} + c^{2} {\vec{p}}^{2} .$ )

Notice that in the nonrelativistic limit the denominator becomes $m c^{2},$ the numerator is $c^{2} \vec{p},$ so the equation is just $\dot{\vec{r}} = (\vec{p} / m) = \vec{v} .$ Away from this limit, of course, the kinetic momentum is $\vec{p} = m \vec{v} / \sqrt{1 - {(v / c)}^{2}},$ the total energy $E = m c^{2} / \sqrt{1 - {(v / c)}^{2}},$ so our equation can be written $\vec{p} = m_{rel} \vec{v}$ with $m_{rel} = m / \sqrt{1 - {(v / c)}^{2}} .$

The second Hamiltonian equation is $\dot{\vec{P}} = - \partial H / \partial \vec{r} .$

Now

$\dot{\vec{P}} = \dot{\vec{p}} + e \dot{\vec{A}} (\vec{r}) / c = \dot{\vec{p}} - e (\vec{E} + \vec{\nabla} φ) + e (\vec{v} \cdot \vec{\nabla}) \vec{A} / c$

using $\vec{E} = - {(\dot{\vec{A}} / c)}_{in place} - \vec{\nabla} φ$ and the last term above is from the convective derivative, since the time derivative $\dot{\vec{A}} (\vec{r})$ includes variation from the particle moving, and

$H = c \sqrt{{(\vec{P} - e \vec{A} / c)}^{2} + m^{2} c^{2}} + e φ,$

$\begin{matrix} - \frac{\partial H}{\partial r_{i}} = \frac{c {(\vec{P} - e \vec{A} / c)}_{j} \nabla_{i} e A_{j} / c}{\sqrt{{(\vec{P} - e \vec{A} / c)}^{2} + m^{2} c^{2}}} - e \nabla_{i} φ \\ = \frac{c p_{j} \nabla_{i} e A_{j} / c}{\sqrt{p^{2} + m^{2} c^{2}}} - e \nabla_{i} φ, \end{matrix}$

so Hamilton’s second equation $\dot{\vec{P}} = - \partial H / \partial \vec{r}$ is

$\dot{\vec{p}} - e (\vec{E} + \vec{\nabla} φ) + e (\vec{v} \cdot \vec{\nabla}) \vec{A} / c = \frac{c p_{j} \nabla_{i} e A_{j} / c}{\sqrt{p^{2} + m^{2} c^{2}}} - e \nabla_{i} φ .$

Now $\frac{c \vec{p}}{\sqrt{p^{2} + m^{2} c^{2}}} = \vec{v}$ (check this!) so

$\dot{\vec{p}} - e \vec{E} = e v_{j} \nabla_{i} A_{j} / c - e v_{j} \nabla_{j} A_{i} / c,$

that is, with $v_{j} \nabla_{i} A_{j} - v_{j} \nabla_{j} A = {(\vec{v} \times (\vec{\nabla} \times \vec{A}))}_{i} = {(\vec{v} \times \vec{B})}_{i},$ we find

$\dot{\vec{p}} = e \vec{E} + e \vec{v} \times \vec{B} / c .$

*The Hamilton-Jacobi Equation

In classical dynamics, there are three standard approaches to the equations of motion: the Lagrangian, the Hamiltonian, and the (somewhat less common) Hamiltonian-Jacobi equation.

This last approach is discussed in my Classical Mechanics lecture on the topic. Briefly, the action is regarded as a function of the endpoint parameters (including time) for fixed initial conditions, and the system follows the classical path for the given initial and final variables. The formalism is a classical analogy of the Schrödinger equation.

This approach is not discussed in Jackson, but Landau uses it to give an elegant analysis of charged particle motion in a Coulomb field, and also to understand how the geometric optics limit of wave equations corresponds to the classical mechanics limit.

previous index next