8. A New Way to Write the Action Integral

Michael Fowler

Introduction

Following Landau, we'll first find how the action integral responds to incremental changes in the endpoint coordinates and times, then use the result to write the action integral itself in a new, more intuitive way. This new formulation shows very directly the link to quantum mechanics, and variation of the action in this form gives Hamilton's equations immediately.

Function of Endpoint Position

We’ll now think of varying the action in a slightly different way. (Note: We’re using Landau's notation.) Previously, we considered the integral of the Lagrangian over all possible different paths from the initial place and time $q^{(1)}, t_{1}$ to the final place and time $q^{(2)}, t_{2}$ and found the path of minimum action. Now, though, we’ll start with that path, the actual physical path, and investigate the corresponding action as a function of the final endpoint variables, given a fixed beginning place and time.

Taking one degree of freedom (the generalization is straightforward), for a small path variation the incremental change in action

$δ S = {[\frac{\partial L}{\partial \dot{q}} δ q]}_{t_{1}}^{t_{2}} + \int_{t_{1}}^{t_{2}} (\frac{\partial L}{\partial q} - \frac{d}{d t} \frac{\partial L}{\partial \dot{q}}) δ q d t$

(Recall that first term comes from the calculus of variations when we allow the end point to vary—it’s exactly the same point we previously discussed in the brachistochrone problem of fastest time for a given horizontal distance, allowing the vertical position of the endpoint to be a free parameter.)

With the incremental variation, we’ve gone from the physical path $P$ (followed by the system in configuration space from $q^{(1)}, t_{1}$ to $q^{(2)}, t_{2}$ ) to a second path $P^{'}$ beginning at the same place and time, and ending at the same time $t_{2}$ as $P,$ but at a slightly different place $q^{(2)} + δ q (t_{2})$ .

Both paths $P, P^{'}$ are fully determined by their initial and final positions and times, so $P, P^{'}$ must correspond to slightly different initial velocities. The important point is that since both paths describe the natural dynamical development of the system from the initial conditions, the system obeys the equations of motion at all times along both paths, and therefore the integral term in the above equation is identically zero.

Writing $δ q (t_{2}) = δ q, p^{(2)} = p$ the action, regarded as a function of the final position variable, with the final time fixed at $t_{2}$ , has the differential

$δ S (q^{(2)}, t_{2}) = {[\frac{\partial L}{\partial \dot{q}} δ q]}_{t_{1}}^{t_{2}} = p^{(2)} δ q^{(2)} = p δ q .$

For the multidimensional case, the incremental change in the action on varying the final position variable is given by (dropping the superscript)

$\partial S / \partial q_{i} = p_{i},$

Function of Endpoint Time

What about the action as a function of the final point arrival time?

Since $S = \int_{t_{1}}^{t_{2}} L d t,$ the total time derivative $d S / d t_{2} = L (q^{(2)}, t_{2}),$ the value of the Lagrangian at the endpoint. Remember we are defining the action at a point as that from integrating along the true path up to that point.

Landau denotes $t_{2}$ by just $t$ , so he writes $d S / d t = L$ , and we’ll be doing this, but it’s crucial to keep in mind that the endpoint position and time are the variables here!

If we now allow an incremental time increase, $t_{2} \to t_{2} + d t$ , with the final coordinate position as a free parameter, the dynamical path will now continue on, to an incrementally different finishing point.

This will give (with $t$ understood from now on to mean $t_{2}$ , and $q_{i}$ means $q_{i}^{(2)}$ )

$\frac{d S (q_{i}, t)}{d t} = \frac{\partial S}{\partial t} + \sum_{i} \frac{\partial S}{\partial q_{i}} {\dot{q}}_{i} = \frac{\partial S}{\partial t} + \sum_{i} p_{i} {\dot{q}}_{i} .$

Putting this together with $d S / d t = L$ gives immediately the partial time derivative

$\partial S / \partial t = L - \sum_{i} p_{i} {\dot{q}}_{i} = - H$

and therefore, combining this with the result $\partial S / \partial q_{i} = p_{i}$ from the previous section,

$d S (q_{i}, t) = \sum_{i} p_{i} d q_{i} - H d t .$

This, then, is the total differential of the action as a function of the spatial and time coordinates of the end of the path.

Varying Both Ends

The argument given above for the incremental change in action from varying the endpoint is clearly equally valid for varying the beginning point of the integral (there will be a sign change, of course), so

$d S (q_{i}^{(2)}, t_{2}, q_{i}^{(1)}, t_{1}) = \sum_{i} p_{i}^{(2)} d q_{i}^{(2)} - H^{(2)} d t_{2} - \sum_{i} p_{i}^{(1)} d q_{i}^{(1)} + H^{(1)} d t_{1} .$

The initial and final coordinates and times specify the action and the time development of the system uniquely.

(Note: We’ll find this equation again in the section on canonical transformations—the action will be seen there to be the generating function of the time-development canonical transformation, this will become clear when we get to it.)

Another Way of Writing the Action Integral

Up to this point, we’ve always written the action as an integral of the Lagrangian with respect to time along the path,

$S (q_{i}^{(2)}, t_{2}, q_{i}^{(1)}, t_{1}) = \int_{q^{(1)}, t_{1}}^{q^{(2)}, t_{2}} L d t$ .

However, the expression derived in the last section for the increment of action generated by an incremental change in the path endpoint is clearly equally valid for the contribution to the action from some interior increment of the path, say from $(q, t)$ to $(q + d q, t + d t)$ , so we can write the total action integral as the sum of these increments:

$S (q_{i}, t) = \int d S = \int (\sum_{i} p_{i} d q_{i} - H d t) .$

In this integral, of course, the $d q_{i}$ add up to cover the whole path.

(In writing $(q_{i}, t)$ we’re following Landau’s default practice of taking the action as a function of the final endpoint coordinates and time, assuming the beginning point to be fixed. This is almost always fine $—$ we’ll make clear when it isn’t.)

How this Classical Action Relates to Phase in Quantum Mechanics

The link between classical and quantum mechanics is particularly evident in the expression for the action integral given above. In the so-called semi-classical regime of quantum mechanics, the de Broglie waves oscillate with wavelengths much smaller than typical sizes in the system. This means that locally it’s an adequate approximation to treat the Schrödinger wave function as a plane wave,

$ψ (x, t) = A (x, t) e^{i (k x - ω t)} = A (x, t) e^{(i / ℏ) (p x - E t)}$

where the amplitude function $A (x, t)$ only varies over distances much greater than the wavelength, and times far longer than the oscillation period. This expression is valid in almost all the classically accessible regions, invalid in the neighborhood of turning points, but the size of those neighborhoods goes to zero in the classical limit.

As we’ve discussed earlier, in the Dirac-Feynman formulation of quantum mechanics, to find the probability amplitude of a particle propagating from one point to another, we add contributions from all possible paths between the two points, each path contributing a term with phase equal to $i / ℏ$ times the action integral along the path.

From the semi-classical Schrödinger wave function above, it’s clear that the change in phase from a small change in the endpoint is $(i / ℏ) (p d x - E d t)$ , coinciding exactly with the incremental contribution to the action in $S = \int d S = \int (\sum_{i} p_{i} d q_{i} - H d t) .$

So again we see, here very directly, how the action along a classical path is a multiple of the quantum mechanical phase change along the path.

Hamilton’s Equations from Action Minimization

For arbitrary small path variations $δ q, δ p$ in phase space, the minimum action condition using the form of action given above generates Hamilton’s equations.

(Note for nitpickers: This may seem a bit surprising, since we generated this form of the action using the equations along the actual dynamical path, how can we vary it and still use them? Bear with me, you’ll see.)

We’ll prove this for a one dimensional system, it’s trivial to go to many variables, but it clutters up the equations.

For a small path deviation $δ q, δ p$ , the change in the action $S = \int (p d q - H d t)$ is

$δ S = \int [δ p d q + p d (δ q) - (\partial H / \partial q) δ q d t - (\partial H / \partial p) δ p d t] = 0$

and integrating $p d (δ q)$ by parts, with $δ p = δ q = 0$ at the endpoints,

$δ S = \int δ p \{d q - (\partial H / \partial p) d t\} + [p δ q] - \int δ q \{d p + (\partial H / \partial q) d t\} = 0.$

The path variations $δ p, δ q$ are independent and arbitrary, so must have identically zero coefficients $—$ Hamilton’s equations follow immediately, $\dot{q} = \partial H / \partial p,$ $\dot{p} = - \partial H / \partial q .$

Again, it’s worth emphasizing the close parallel with quantum mechanics: Hamilton’s equations written using Poisson brackets are:

$\dot{q} = [H, q], \dot{p} = [H, p] .$

In quantum mechanics, the corresponding Heisenberg equations of motion for position and momentum operators in terms of commutators are

$\dot{q} = (1 / i ℏ) [H, q], \dot{p} = (1 / i ℏ) [H, p] .$

How Can p, q Really Be Independent Variables?

It may seem a little odd at first that varying $p, q$ as independent variables leads to the same equations as the Lagrangian minimization, where we only varied $q,$ and that variation “locked in” the variation of $\dot{q} .$ And, isn’t $p$ defined in terms of $q, \dot{q}$ by $p = \partial L / \partial \dot{q},$ which is some function of $q, \dot{q}$ ? So wouldn’t varying $q$ automatically determine the variation of $p$ ?

The answer is, no, $p$ is not defined as $p = \partial L / \partial \dot{q}$ from the start in Hamilton’s formulation. In this Hamiltonian approach, $p, q$ really are taken as independent variables, then varying them to find the minimum path gives the equations of motion, including the relation between $p$ and $q, \dot{q}$ .

This comes about as follows: Along the minimum action path, we just established that

$d H (p, q) = \dot{q} d p - \dot{p} d q .$

We also have that $L = p \dot{q} - H,$ so (Legendre transformation!)

$d L (q, \dot{q}) = p d \dot{q} + \dot{p} d q,$

from which, along the physical path, $p = {(\partial L / \partial \dot{q})}_{q constant} .$ So this identity, previously written as the definition of $p,$ now arises as a consequence of the action minimization in phase space.

previous home next PDF