51 Causality and the Kramers-Krönig Relations

(Jackson 7.10)

Causality: Polarization as a Response to a Field

The frequency dependence of $ε (ω)$ reflects the time lag in the response of material polarization to an imposed field: recall $\vec{D} = ε \vec{E} = ε_{0} \vec{E} + \vec{P} .$ The $ε_{0}$ term indicates an instantaneous local response (really more of a change in notation than a physical transformation) but the contribution to $\vec{D}$ from the polarization $\vec{P}$ takes a finite time, because it's response of a material to a force: even electronic oscillators take time to respond fully when a driving force is switched on.

This is not so evident in frequency space, where we write

$\vec{D} (\vec{x}, ω) = ε (ω) \vec{E} (\vec{x}, ω),$

but that equation describes the material response to a monochromatic field, therefore one that has been oscillating forever.

To analyze response to a field that's switched on in time, we need to go to the Fourier transforms, summing over frequencies to get the appropriate time dependence:

$\vec{D} (\vec{x}, t) = \frac{1}{2 π} \int_{- \infty}^{\infty} \vec{D} (\vec{x}, ω) e^{- i ω t} d ω, \vec{D} (\vec{x}, ω) = \int_{- \infty}^{\infty} \vec{D} (\vec{x}, t) e^{i ω t} d t,$

$\begin{matrix} \vec{D} (\vec{x}, t) = \frac{1}{2 π} \int_{- \infty}^{\infty} \vec{D} (\vec{x}, ω) e^{- i ω t} d ω = \frac{1}{2 π} \int_{- \infty}^{\infty} ε (ω) \vec{E} (\vec{x}, ω) e^{- i ω t} d ω \\ = \frac{1}{2 π} \int_{- \infty}^{\infty} ε (ω) \int_{- \infty}^{t} \vec{E} (\vec{x}, t^{'}) e^{i ω t^{'}} e^{- i ω t} d t^{'} d ω . \end{matrix}$

Notation: We use the standard physics normalization of Fourier transforms, writing $d ω / 2 π$ and $d t .$ In this section (7.10) Jackson uses the mathematician’s convention, $d ω / \sqrt{2 π}, d t / \sqrt{2 π},$ even though he used the physics normalization earlier, for example in equations 6.33, 6.34.)

This is from Jackson, but we've made one slight change: this section is headed causality, the equation describes how the electric field at time $t^{'}$ at point $\vec{x}$ generates polarization that contributes to the field at the same point $\vec{x}$ at a later time $t .$ Think of it as a local damped oscillator at $\vec{x}$ that has been subject to a time-varying driving force $\vec{E} (\vec{x}, t^{'}),$ and we're now finding its phase and amplitude at time $t .$

Obviously, this can only depend on the driving force at earlier times, so we integrate $d t^{'}$ up to $t,$ not to infinity. It's convenient to introduce the time difference variable, $t - t^{'} = τ .$ This gives $d t^{'} = - d τ,$ but the range of integration goes from $(- \infty, t)$ to $(\infty, 0)$ which we then switch to $(0, \infty)$ giving a second minus sign.

Finally, writing $ε (ω) = 1 + χ_{e} (ω),$ (from lecture 29),

$\begin{matrix} \vec{D} (\vec{x}, t) = \frac{1}{2 π} \int_{- \infty}^{\infty} ε (ω) \int_{0}^{\infty} \vec{E} (\vec{x}, t - τ) e^{- i ω τ} d τ d ω, \\ = \frac{ε_{0}}{2 π} \int_{- \infty}^{\infty} (1 + χ_{e} (ω)) \int_{0}^{\infty} \vec{E} (\vec{x}, t - τ) e^{- i ω τ} d τ d ω . \end{matrix}$ .

Actually, we should have treated this term separately from the start: it's the instant vacuum response of $\vec{D}$ to $\vec{E},$ and by (unnecessarily) incorporating it in the integral we've made things slightly awkward, the $\int_{- \infty}^{\infty} \frac{d ω}{2 π} e^{- i ω t}$ integral gives $δ (τ)$ and we end the integral at $τ = 0,$ we need $τ = 0_{-}$ to make sense. Maybe this is why Jackson went to infinity, but I think doing that obscures the causal nature of the equation.

Having dealt with the first term in $\vec{D} = ε_{0} \vec{E} + \vec{P},$ let's focus on the less trivial second term.

We'll define the Green's function

$\begin{matrix} G (τ) = \frac{1}{2 π} \int_{- \infty}^{\infty} [(ε (ω) / ε_{0}) - 1] e^{- i ω τ} d ω, \\ = \frac{1}{2 π} \int_{- \infty}^{\infty} χ_{e} (ω) e^{- i ω τ} d ω \end{matrix}$

assuming with Jackson that the reversal of order of integrations is OK.

Note: We’re assuming throughout that the response at $\vec{x}$ does not depend on the earlier field at some different point ${\vec{x}}^{'} :$ that is, we’re assuming nonlocality in time, but locality in space. This works well for dielectrics including in the visible range, but breaks down for metals if the mean free path of electrons between collisions exceeds the scale of field variation (for example, skin depth). In practice, this only occurs with very pure metals at very low (helium) temperatures, and very high frequencies (GHz). it’s called the anomalous skin effect.

Simple Model for $G (τ)$

To illustrate the technique, we'll assume initially that there is only one resonant frequency:

$\frac{ε (ω)}{ε_{0}} - 1 = \frac{ω_{p}^{2}}{ω_{0}^{2} - ω^{2} - i γ ω},$

$G (τ) = \frac{ω_{p}^{2}}{2 π} \int_{- \infty}^{\infty} \frac{e^{- i ω τ}}{ω_{0}^{2} - ω^{2} - i γ ω} d ω .$

(In fact, not a bad approximation in the region near that frequency.)

We can evaluate this by contour integration. Notice that for positive $τ,$ the $e^{- i ω τ}$ term diverges in the upper half plane, so we must complete the contour along the real axis with a large semicircle in the lower half plane.

For negative $τ,$ we must complete the contour with a semicircle in the upper half plane.

The only singularities in the integrand are the two poles at the roots of the equation $ω_{0}^{2} - ω^{2} - i γ ω = 0.$

They are poles in the lower half plane at $ω_{1, 2} = - \frac{1}{2} i γ \pm ν_{0}, ν_{0}^{2} = ω_{0}^{2} - \frac{1}{4} γ^{2} .$

Hence, for $τ > 0,$

$G (τ) = ω_{p}^{2} e^{- γ τ / 2} \frac{\sin ν_{0} τ}{ν_{0}} θ (τ),$

and (reassuringly!) for $τ < 0, G (τ) = 0.$

The analyticity of $G (ω)$ in the upper half complex plane is equivalent to causality.

(From quantum mechanics, the width of a spectral line ( $\sim γ$ ) is the inverse of the lifetime of the excited state, so the Green’s function decays in a time of order the lifetime of the state, of order microseconds to nanoseconds.)

Causality and Analyticity of $ε (ω)$

We have established that

$\vec{D} (\vec{x}, t) = ε_{0} [\vec{E} (\vec{x}, t) + \int_{0}^{\infty} G (τ) \vec{E} (\vec{x}, t - τ) d τ] .$

Now these are real values of fields at some instant in time, not frequency components that could have phase lags, so the Green's function $G (τ)$ is real.

Recalling its definition as the Fourier transform of $χ_{e} (ω),$

$G (τ) = \frac{1}{2 π} \int_{- \infty}^{\infty} [(ε (ω) / ε_{0}) - 1] e^{- i ω τ} d ω,$

we can Fourier transform back to find,

$\frac{ε (ω)}{ε_{0}} = 1 + \int_{0}^{\infty} G (τ) e^{i ω τ} d τ .$

Since $G (τ)$ is real, it follows that

$\frac{ε (- ω)}{ε_{0}} = \frac{ε^{*} (ω^{*})}{ε_{0}} .$

It is apparent from the formula for $ε (ω)$ in terms of $G (τ)$ that $ε (ω)$ is analytic in the upper half plane, and if we can assume $G (τ)$ is finite and goes to zero at infinite time, then this is also true down to the real axis.

(Note: This doesn't work for conductors: recall $ε (ω) = ε_{b} + i (σ / ω),$ there is a pole at the origin. To understand what this does to the Green's function, we must go back to $G (τ) = \frac{1}{2 π} \int_{- \infty}^{\infty} [(ε (ω) / ε_{0}) - 1] e^{- i ω τ} d ω$ . The integrand will now include a term $i σ / ω ε_{0},$ and the integral over the whole real line must include a small semicircle in the upper half plane around the origin, the formula is $1 / ω = P / ω - i π δ (ω) .$ This pole gives a time-independent contribution $σ / ε_{0}$ to $G (τ),$ the other terms are from oscillators with finite damping, and die away for long times.)

Kramers-Krönig Relations

Since $ε (ω) / ε_{0}$ is analytic in the upper half plane, we can use Cauchy's theorem to relate the real and imaginary parts. The real part $ε^{'} (ω) / ε_{0}$ is the square of the refractive index, the imaginary part is the absorption. Note that if a medium has a refractive index different from that of the vacuum (as all do!) then there must be nonzero absorption in some frequency range.

For any $z$ in the upper half plane, the only singularity in the integrand below is at $ω^{'} = z,$ a simple pole, so for a contour encircling the upper half plane,

$\frac{ε (z)}{ε_{0}} - 1 = \frac{1}{2 π i} \oint_{C} \frac{[(ε (ω^{'}) / ε_{0}) - 1]}{ω^{'} - z} d ω^{'}$ .

Since we know $(ε (ω) / ε_{0}) - 1 \sim ω^{- 2}$ at infinity, we can neglect the contribution from the large semicircle, leaving

$\frac{ε (z)}{ε_{0}} = 1 + \frac{1}{2 π i} \int_{- \infty}^{\infty} \frac{[ε (ω^{'}) / ε_{0} - 1]}{ω^{'} - z} d ω^{'}$

We now take the point in the upper half plane to be infinitesimally above the real axis,

$\frac{ε (ω)}{ε_{0}} = 1 + \frac{1}{2 π i} \int_{- \infty}^{\infty} \frac{[ε (ω^{'}) / ε_{0} - 1]}{ω^{'} - ω - i δ} d ω^{'}$

and now use the identity

$\frac{1}{ω^{'} - ω - i δ} = \frac{P}{ω^{'} - ω} + i π δ (ω^{'} - ω)$

to find

$\begin{array}{l} ε^{'} (ω) / ε_{0} = 1 + \frac{1}{π} P \int_{- \infty}^{\infty} \frac{ε^{″} (ω^{'}) / ε_{0}}{ω^{'} - ω} d ω^{'}, \\ ε^{″} (ω) / ε_{0} = - \frac{1}{π} P \int_{- \infty}^{\infty} \frac{ε^{'} (ω^{'}) / ε_{0} - 1}{ω^{'} - ω} d ω^{'}, \end{array}$

Remember now that $ε (- ω) = ε^{*} (ω^{*}),$ so $ε^{'} (ω)$ is even, $ε^{″} (ω)$ is odd $—$ and we can put positive and negative frequencies together,

$\begin{array}{l} ε^{'} (ω) / ε_{0} = 1 + \frac{2}{π} P \int_{0}^{\infty} \frac{ω^{'} ε^{″} (ω^{'}) / ε_{0}}{{ω^{'}}^{2} - ω^{2}} d ω^{'}, \\ ε^{″} (ω) / ε_{0} = - \frac{2 ω}{π} P \int_{0}^{\infty} \frac{ε^{'} (ω^{'}) / ε_{0} - 1}{{ω^{'}}^{2} - ω^{2}} d ω^{'} . \end{array}$

that can be added.)

It's worth relating these equations to our model of the dielectric in terms of a set of oscillators. Think about the complex function $ε (z)$ in the neighborhood of one of the poles, it's proportional to

$ε (z) \sim \frac{1}{z - z_{0}} = \frac{1}{ω - (ν_{0} - i \frac{1}{2} γ)} .$

How does this function vary on going along the real axis past the pole? There is a peak in the imaginary part, its width of order $γ$ and its height $2 / γ$ . The contribution from this pole to the real part of $ε (ω)$ changes sign and essentially cancels.

Physical Significance of the Kramers-Krönig Equations

The function $ε^{'} (ω) / ε_{0}$ is just the square of the refractive index $n (ω) .$ The first equation tells us that the refractive index of a material for light at a given frequency depends entirely on the rate of absorption of radiation, suitably weighted, summed over all frequencies. The refractive index would be unity (no refraction) if the material didn't absorb at some frequency.

And, the reason we can write these equations at all is that $ε (ω)$ is analytic as a function of a complex variable in the upper half plane.

The analyticity follows directly from causality: the Fourier transform of $ε (ω), G (τ)$ measures the response of the medium to an imposed field at a given time, and can only be nonzero at later times.

But that's true for all physical processes, in particular the response of an atom, a nucleus or an elementary particle to an ingoing wave. And any of these systems can be modeled in terms of oscillators. The scattering of one elementary particle off another is described in terms of a scattering matrix: a matrix rather than a simple function because there are multiple outcomes possible. Each possible outcome is a scattering matrix element, just a function of energy (and possible angular momentum) and these functions obey Kramers-Krönig equations. They have similar behavior to that shown above, and resonances correspond to excited states of the particle, that is, unstable heavier particles. For particle scattering, the analogue of the refractive index is the phase shift of the wave function, so measuring these phase shifts can indicate the presence of resonances, other particles, at different energies.

Sum Rules

We’ve already met a sum rule: recall in lecture 47 we generalized the permittivity from a single oscillator

$\frac{ε (ω)}{ε_{0}} = 1 + \frac{N e^{2}}{ε_{0} m (ω_{0}^{2} - ω^{2} - i γ ω)}$

to a collection of similar oscillators, to find

$\frac{ε (ω)}{ε_{0}} = 1 + \frac{N e^{2}}{ε_{0} m} \sum_{j} \frac{f_{j}}{ω_{j}^{2} - ω^{2} - i γ_{j} ω}$

where $f_{j}$ is the number of electrons in a molecule with parameters $ω_{j}, γ_{j},$ and now $N$ is the number of molecules in unit volume. Generally, these oscillations are lightly damped, $γ_{j}$ is small, so $ε (ω)$ has a very small imaginary part except very close to one of the $ω_{j}$ 's.

The oscillator strengths satisfy a sum rule, $\sum_{j} f_{j} = 1.$ (Derived from the knowledge that at high frequencies $\frac{ε_{pl} (ω)}{ε_{0}} ≅ 1 - \frac{ω_{pl}^{2}}{ω^{2}}, ω_{pl}^{2} = \frac{n_{e} e^{2}}{ε_{0} m},$ all the electrons are essentially free.)

Suppose now we take the first of the Kramers-Krönig equations

$ε^{'} (ω) / ε_{0} = 1 + \frac{2}{π} P \int_{0}^{\infty} \frac{ω^{'} ε^{″} (ω^{'}) / ε_{0}}{{ω^{'}}^{2} - ω^{2}} d ω^{'}$

in the limit $ω \to \infty,$ to find ( $P$ is now irrelevant)

$1 - \frac{ω_{pl}^{2}}{ω^{2}} = 1 + \frac{2}{π ω^{2} ε_{0}} \int_{0}^{\infty} ω^{'} ε^{″} (ω^{'}) d ω^{'},$

$ω_{pl}^{2} = + \frac{2}{π ε_{0}} \int_{0}^{\infty} ω^{'} ε^{″} (ω^{'}) d ω^{'} .$

This means that the sum rule found for oscillators is still true for more general functions, but the reality of $G (τ)$ means these functions have the same reality/symmetry constraints, so the complex $ε (z)$ can be represented by an if necessary infinite number of oscillators $—$ for example, a cut in the complex plane is equivalent within that plane to an infinite number of infinitesimal poles, etc.

Jackson presents a second sum rule, that the average value of $ε^{'} (ω) / ε_{0}$ over all frequencies is unity. Looking at the expression (and the graph) for a single oscillator, it is clearly true in that case. Adding many oscillators, with strengths obeying the sum rule, it is again the case. As argued above, any $ε (z)$ (with the required symmetry and reality conditions) can be approximated as a sum over oscillators, so the result follows.

Jackson 7.11: Arrival of a Signal After Propagation through a Dispersive Medium

This is a depreciated version of the treatment in the Second Edition of Jackson. Jackson evidently thinks it’s not as important as he once did, so we’ll just mention it here. If you find it interesting, look in the Second Edition. The first point is that the signal cannot arrive faster than light in a vacuum, this follows from very general analyticity properties of the permittivity. Next comes a discussion of precursors, work done around 1914 by Sommerfeld and Brillouin. A signal having a sharp leading edge is sent into a dielectric, and subsequently detected. The first precursor (Sommerfeld) is a weak high frequency signal. This is perhaps not surprising: the required sharp initial edge needed high frequency components, and presumably they will outrun the others? The second precursor is more interesting, the Brillouin precursor. This is low frequency, and quite strong, and chirps. A proper mathematical treatment takes a substantial amount of work, and we just don’t have time here. However, this subject is of more than academic interest $—$ googling Brillouin precursor reveals fairly extensive discussion of possible related problems in 5G communications.

previous index next