18. Electrostatics Using Spherical Coordinates: Spherical Harmonics

Introduction

There are many situations on electrostatics, starting even with a single point charge at the origin, where the $(x, y, z)$ coordinates are a poor choice for analyzing the field $—$ the potential depends only on radial distance $r,$ so obviously $r$ needs to be one of the coordinates. The other two must fix location on the constant $r$ spherical surface. The standard choice is of course latitude and longitude, except that in physics we take the latitude angle $θ$ to be zero at the north pole, increasing to $π / 2$ at the equator and $π$ at the south pole.

The methods developed below for dealing with electric (and electromagnetic) fields in this coordinate system are often the most natural way of analyzing charge and current distributions, and the related electromagnetic radiation emanating from a bounded set of such oscillating charges/currents. The same analysis works on all scales from atoms, through antennas and planetary phenomena, to galactic scales and above, where, admittedly, general relativistic effects will make the analysis more interesting (and we won’t cover that here).

Consider now the electric potential $φ$ of a point charge not at the origin, but on the $z -$ axis : in $(r, θ, ϕ)$ coordinates, it clearly does not depend on $ϕ .$ in fact, a wide variety of systems have this azimuthal (meaning $ϕ$ - independent) symmetry, and $(r, θ)$ are the natural coordinates for analyzing their electrostatic fields.

Expressions for the common differential operators in $(r, θ, ϕ)$ can be found in my notes here.

Spherical Coordinates

In $(r, θ, ϕ)$ coordinates, Laplace's equation becomes:

$\nabla^{2} φ = \frac{1}{r^{2}} \frac{\partial}{\partial r} r^{2} \frac{\partial φ}{\partial r} + \frac{1}{r^{2}} \{\frac{1}{\sin θ} \frac{\partial}{\partial θ} (\sin θ \frac{\partial φ}{\partial θ}) + \frac{1}{\sin^{2} θ} \frac{\partial^{2} φ}{\partial ϕ^{2}}\} = 0.$

As usual, we look for factorizable solutions (which, it later turns out, form a complete basis for all physically relevant solutions). That is, we take

$φ (r, θ, ϕ) = R (r) Θ (θ) Φ (ϕ)$ .

(Note: This is the standard notation, as in Wikipedia, Griffiths, etc. Jackson, however, writes $φ (r, θ, ϕ) = \frac{U (r)}{r} P (θ) Q (ϕ) .$ )

Making this substitution in the above equation, and then dividing by $R Θ Φ$ as usual (for instance, in quantum treatment of the hydrogen atom), gives a sum of three terms to be zero, the first term a function of $r$ only, the second of $θ$ only, the third of $ϕ .$ (If you do need a reminder of how this works, details can be found here.) These three terms must always add to zero: that is, for arbitrary values of $r, θ, ϕ .$ This can only happen if the three terms are in fact separately constant, otherwise we could just vary one of them to get a nonzero result. (The individual constants of course add to zero.)

The simplest term to deal with is the azimuthal variation

$\frac{1}{Φ} \frac{d^{2} Φ}{d ϕ^{2}} = constant .$

Since the potential is real and single valued, and $ϕ$ goes all the way around the spherical axis, we must have $Φ (ϕ)$ a sum of terms $A_{m} \sin m ϕ + B_{m} \cos m ϕ$ for integer values of $m .$

The next term (in the $\{\}$ brackets in the differential equation) gives that $Θ (θ)$ must satisfy

$\frac{1}{\sin θ} \frac{d}{d θ} (\sin θ \frac{d Θ}{d θ}) - \frac{m^{2} Θ}{\sin^{2} θ} = - l (l + 1) Θ$

where at this point we've used $\partial^{2} φ / \partial ϕ^{2} = - m^{2} φ .$ Although at this stage $l (l + 1)$ is an arbitrary constant, we write it this way using hindsight, because we’ll find the final equation, that for $R (r),$ becomes

$\frac{d}{d r} (r^{2} \frac{d R}{d r}) = l (l + 1) R,$

$R = A r^{l} + B r^{- (l + 1)}$ .

We'll assume the field at the origin does not vary as a non-integral power of $r .$

Non-integral behavior would only happen in certain special geometries, such as conical conducting surfaces. We encountered similar behavior in 2D for two conducting sheets at different potentials intersecting along a line with an arbitrary angle between them.

To study the equation for $Θ (θ)$ further, it's convenient to set $x = \cos θ$ , so the equation becomes

$(\frac{d}{d x} (1 - x^{2}) \frac{d}{d x} + l (l + 1) - \frac{m^{2}}{1 - x^{2}}) Θ (x) = 0.$

Axially Symmetric Case: Legendre’s Equation and Polynomials

In this section, we'll examine the axially symmetric case, $m = 0,$ the differential equation becomes

$(\frac{d}{d x} (1 - x^{2}) \frac{d}{d x} + l (l + 1)) Θ (x) = 0.$

This is Legendre’s equation, and can be written as an eigenvalue equation, switching here to the standard notation $P_{l} (x)$ :

$- (\frac{d}{d x} (1 - x^{2}) \frac{d}{d x}) P_{l} (x) = l (l + 1) P_{l} (x)$

on the interval $π \geq θ \geq 0,$ that is, $- 1 \leq x \leq 1$ .

This is different from our previous operators in that the operator itself has singular points at the endpoints of the interval, so some discussion is required to establish that a complete orthogonal set of eigenfunctions can in fact be found.

Notice first that for $l = 0,$ possible independent solutions are $P_{0} (x) = 1$ and $P_{0} (x) = \ln (1 + x) - \ln (1 - x) .$ This second solution, although mathematically ok, is singular at the north and south poles, which are not special points $—$ we could have set our coordinate system at any orientation. So from now on we'll drop singular solutions (for any value of $l$ ).

For $l \neq 0,$ if we try a power series solution $P_{l} (x) = \sum_{j = 0}^{\infty} a_{j} x^{j},$ we find

$a_{j + 2} = [\frac{j (j + 1) - l (l + 1)}{(j + 1) (j + 2)}] a_{j} .$

Evidently, the series terminates at $j = l .$ But what if $j$ is even and $l$ is odd? Then the series will not terminate, the coefficients will asymptotically be equal, and the series will diverge at $x = 1.$ This is not a physical solution of the original differential equation, so we conclude that if $l$ is even/odd, the series, a polynomial of degree $l,$ only includes even/odd powers of $x .$

The orthogonality of the different $P_{l}$ 's follows from the equation

$[l (l + 1) - m (m + 1)] \int_{- 1}^{1} P_{l} (x) P_{m} (x) d x = {[(P_{l} {P^{'}}_{m} - {P^{'}}_{l} P_{m}) (1 - x^{2})]}_{- 1}^{1} .$

Since we are taking only nonsingular solutions, the right-hand side is zero.

We Could Have Found These Polynomials Using Schmidt Orthogonalization...

We've just shown that the $P_{l} (x)$ are a set of orthogonal polynomials in $x$ on the interval $(- 1, 1) .$

But the standard Schmidt orthogonalization procedure applied to $1, x, x^{2}, \dots$ generates a unique set of orthogonal polynomials, so these have to be the $P_{l}$ 's , apart from normalization.

Warning! In contrast to orthogonal sets of functions we've previously encountered (such as Fourier series), the Legendre polynomials, by tradition, are not defined to be orthonormal: their normalization is defined by

$P_{l} (1) = 1.$

The reason for this normalization convention will soon become evident.

Here are plots of the first few $P_{l}$ 's (from Wikipedia).

From the Schmidt process plus normalization

$\begin{array}{l} P_{0} (x) = 1 \\ P_{1} (x) = x \\ P_{2} (x) = \frac{1}{2} (3 x^{2} - 1) \\ P_{3} (x) = \frac{1}{2} (5 x^{3} - 3 x) \end{array}$

Rodrigues' Formula

Rodrigues discovered a very neat formula for generating the polynomials without having to go up the Schmidt ladder:

$P_{l} (x) = \frac{1}{2^{l} l!} \frac{d^{l}}{d x^{l}} {(x^{2} - 1)}^{l} .$

It's straightforward to prove that $P_{l} (x)$ defined in this way is a solution of the differential equation, and that the different $P_{l}$ ’s are orthogonal to each other.

Exercise: do it!

Therefore, since they are polynomials of the appropriate order, and mutually orthogonal, they must be the same as those found by successive orthogonalization, apart from normalization. Rodrigues' prefactor ensures the standard normalization $P_{l} (1) = 1.$ With this requirement, the $P_{l}$ 's are not orthonormal, in fact (as can be proved from Rodrigues' formula)

$\int_{- 1}^{1} P_{l} (x) P_{m} (x) d x = \frac{2}{2 l + 1} δ_{l m} .$

They have alternating parity:

$P_{l} (- x) = {(- 1)}^{l} P_{l} (x) .$

The Potential from a Point Charge in Legendre Polynomials: the Generating Function

The general axially symmetric (meaning $m = 0$ ) solution of the Laplace equation, $\nabla^{2} φ (r, θ) = 0,$ is

$φ (r, θ) = \sum_{l = 0}^{\infty} (a_{l} r^{l} + b_{l} r^{- l - 1}) P_{l} (\cos θ) .$

As a preliminary exercise, we take a unit charge on the $z -$ axis at $\vec{R} = (0, 0, R) .$

Then the solution of $\nabla^{2} φ = 0$ in the region $|\vec{r}| < R$ must have the form (dropping the $1 / 4 π ε_{0}$ , we're just doing math here, and we don’t want to clutter it up) :

$φ (\vec{r}) = \frac{1}{|\vec{r} - \vec{R}|} = \frac{1}{\sqrt{r^{2} + R^{2} - 2 r R \cos θ}} = \sum_{l = 0}^{\infty} a_{l} r^{l} P_{l} (\cos θ) .$

(The $b_{l}$ ’s in the more general expansion given above have to all be zero, since $φ$ is not singular at the origin $—$ it’s just from the one charge, which is not at the origin.)

We can now find the $a_{l}$ ’s by setting $θ = 0$ and Taylor expanding:

$\frac{1}{|\vec{r} - \vec{R}|} = \frac{1}{r - R} = \frac{1}{R} (1 + (\frac{r}{R}) + {(\frac{r}{R})}^{2} + \dots)$

Therefore,

$\frac{1}{\sqrt{r^{2} + R^{2} - 2 r R \cos θ}} = \frac{1}{R} \sum_{l = 0}^{\infty} {(\frac{r}{R})}^{l} P_{l} (\cos θ), r < R .$

It's now evident why the normalization $P_{l} (1) = 1$ is a good choice!

The function on the left is called the generating function of the Legendre polynomials. (A generating function is a function of two variables: when it is expressed as a power series in one of the variables, the successive coefficients are the functions of the other variable we’re generating.)

An exactly similar argument, based on the nondivergent behavior of $φ (r, θ)$ at infinity, gives

$\frac{1}{\sqrt{r^{2} + R^{2} - 2 r R \cos θ}} = \frac{1}{r} \sum_{l = 0}^{\infty} {(\frac{R}{r})}^{l} P_{l} (\cos θ), r > R .$

You will often see these two formulas combined and written

$\frac{1}{|\vec{r} - \vec{R}|} = \frac{1}{\sqrt{r^{2} + R^{2} - 2 r R \cos θ}} = \frac{1}{r_{>}} \sum_{l = 0}^{\infty} {(\frac{r_{<}}{r_{>}})}^{l} P_{l} (\cos θ) .$

Using the Generating Function to Find Recursion Relations among Legendre Polynomials

In this section, to make the equations easier to read, we'll go back to the notation $\cos θ = x,$ and also take $R = 1$ (in the equation at the end of the last section). Then, writing the generating function as $G (x, r),$ we have

$G (x, r) = \frac{1}{\sqrt{1 - 2 x r + r^{2}}} = \sum_{l = 0}^{\infty} r^{l} P_{l} (x) .$

(Just repeating the equation for the case $r < 1$ in the previous section.)

We can find recursion relations among the polynomials easily by differentiating the generating function with respect to its variables, then matching coefficients in the power series.

First, differentiating $G (x, r)$ with respect to $r,$

$\frac{\partial G (x, r)}{\partial r} = \frac{x - r}{{(1 - 2 x r + r^{2})}^{3 / 2}},$

$(1 - 2 x r + r^{2}) \frac{\partial G (x, r)}{\partial r} = (x - r) G (x, r) .$

Now writing this in terms of the polynomials,

$(1 - 2 x r + r^{2}) \sum_{n = 0}^{\infty} l r^{l - 1} P_{l} (x) = (x - r) \sum_{l = 0}^{\infty} r^{l} P_{l} (x) .$

Bonnet's Formula

Comparing coefficients of $r^{l}$ on the two sides we find

$(l + 1) P_{l + 1} - 2 x l P_{l} + (l - 1) P_{l - 1} = x P_{l} - P_{l - 1},$

giving Bonnet's formula:

$(l + 1) P_{l + 1} - (2 l + 1) x P_{l} + l P_{l - 1} = 0.$

Notice that Bonnet’s formula enables a recursive method for successively constructing the polynomials: starting with $P_{0} = 1, P_{1} = x$ we can generate them all.

Bonnet's formula is also useful for finding $\int x P_{l} P_{m} d x,$ evidently only nonzero for $m = \pm l .$ (This is relevant for transitions between atomic angular momentum states induced by an external electric field.)

Finally, using Bonnet's formula it's easy to find the value of the (even) polynomials at the origin:

$P_{2 l} (0) = \frac{2 l - 1}{2 l} P_{2 l - 2} (0) = \frac{(2 l - 1) (2 l - 3) (2 l - 5) \dots 1}{2 l (2 l - 2) (2 l - 4) \dots 2} .$

Recursion Relations Including Derivatives

More recursion relations are generated by differentiating $G (x, r)$ with respect to $x$

$(1 - 2 x r + r^{2}) \frac{\partial G (x, r)}{\partial x} = r G (x, r),$

and equating the coefficient of $r^{l + 1}$ on the two sides, to find

${P^{'}}_{l + 1} - 2 x {P^{'}}_{l} + {P^{'}}_{l - 1} = P_{l} .$

Subtracting $(2 l + 1)$ times this equation from twice the differential of Bonnet's equation, $(l + 1) P_{l + 1} - (2 l + 1) x P_{l} + l P_{l - 1} = 0$ gives

${P^{'}}_{l + 1} - {P^{'}}_{l - 1} = (2 l + 1) P_{l} .$

Putting this with

${P^{'}}_{l + 1} + {P^{'}}_{l - 1} = P_{l} + 2 x {P^{'}}_{l},$

adding and subtracting give:

$\begin{array}{l} {P^{'}}_{l + 1} (x) = (l + 1) P_{l} (x) + x {P^{'}}_{l} (x), \\ {P^{'}}_{l - 1} (x) = - l P_{l} (x) + x {P^{'}}_{l} (x) . \end{array}$

We can actually use these recursion relations to prove that the polynomials do indeed satisfy Legendre's equation: here we go.

Taking the top equation of the two, but shift the index down by one, to get

${P^{'}}_{l} (x) = l P_{l - 1} (x) + x {P^{'}}_{l - 1} (x),$

Now add this to $x$ times the bottom equation,

$x {P^{'}}_{l - 1} (x) + {P^{'}}_{l} (x) = - x l P_{l} (x) + x^{2} {P^{'}}_{l} (x) + l P_{l - 1} (x) + x {P^{'}}_{l - 1} (x),$

$(1 - x^{2}) {P^{'}}_{l} (x) = - x l P_{l} (x) + l P_{l - 1} (x),$

Then take the derivative,

${((1 - x^{2}) {P^{'}}_{l} (x))}^{'} = - l P_{l} (x) - x l {P^{'}}_{l} (x) + l {P^{'}}_{l - 1} (x),$

and finally, using ${P^{'}}_{l - 1} (x) = - l P_{l} (x) + x {P^{'}}_{l} (x),$ we find

${((1 - x^{2}) {P^{'}}_{l} (x))}^{'} + l (l + 1) P_{l} (x) = 0,$

checking that the Legendre polynomials generated this way satisfy Legendre's equation $—$ which of course we knew all along, since we began with a generating function satisfying Laplace's equation in the relevant region, so expanding $\nabla^{2} φ$ in powers of $r,$ each term must satisfy it.

One final point: the formula

${P^{'}}_{l + 1} - {P^{'}}_{l - 1} = (2 l + 1) P_{l}$

can be used to evaluate

$\int_{0}^{1} P_{l} (x) d x,$

since we found the values of the polynomials at $x = 0$ above. (Notice the integral must be zero for an even polynomial, since it's orthogonal to $P_{0} = 1.$ )

General Spherical Harmonics

It’s time to move from azimuthal symmetry to harmonics depending on both $θ$ and $ϕ,$ necessary in describing the electric potential from more general charge distributions.

So we’re back to

$(\frac{d}{d x} (1 - x^{2}) \frac{d}{d x} + l (l + 1) - \frac{m^{2}}{1 - x^{2}}) Θ = 0$

with $l, m$ integers. It turns out that the solutions are

$P_{l}^{m} (x) = {(- 1)}^{m} {(1 - x^{2})}^{m / 2} \frac{d^{m}}{d x^{m}} P_{l} (x)$

and the spherical harmonics are defined as

$Y_{l}^{m} (θ, φ) = \sqrt{\frac{2 l + 1}{4 π} \frac{(l - m)!}{(l + m)!}} P_{l}^{|m|} (\cos θ) e^{i m ϕ}, - l \leq m \leq l .$

These are orthonormal (from the corresponding property of the Legendre polynomials)

$\int_{0}^{2 π} d ϕ \int_{0}^{π} \sin θ d θ Y_{l^{'}}^{m^{'}} * (θ, ϕ) Y_{l}^{m} (θ, ϕ) = δ_{l^{'} l} δ_{m^{'} m},$

and complete:

$\sum_{l = 0}^{\infty} \sum_{m = - l}^{l} Y_{l}^{m} * (θ^{'}, ϕ^{'}) Y_{l}^{m} (θ, ϕ) = δ (ϕ - ϕ^{'}) δ (\cos θ - \cos θ^{'}),$

nontrivial to prove, see for example Byron and Fuller for a proof.

The general solution of $\nabla^{2} φ = 0$ can be written as

$φ (r, θ, ϕ) = \sum_{l = 0}^{\infty} \sum_{m = - l}^{l} (a_{l m} r^{l} + b_{l m} r^{- l - 1}) Y_{l}^{m} (θ, ϕ) .$

It is sometimes useful to borrow notation from quantum mechanics, and write these as $Y_{l}^{m} (θ, ϕ) \equiv |l, m〉 .$ Localized states on the surface of the sphere, and the corresponding delta function, can be written

$〈θ^{'}, ϕ^{'}| θ, ϕ〉 = δ (\cos θ^{'} - \cos θ) δ (ϕ^{'} - ϕ)$

and

$Y_{l}^{m} (θ, ϕ) = 〈θ, ϕ |l, m〉 .$

(Note: the area measure on the unit sphere is $\sin θ d θ d ϕ = - d (\cos θ) d ϕ .$ this is why the delta function has the form above. To check the sign's right, remember $\cos θ$ decreases as a function of $θ$ in $(0, π)$ .)

The spherical harmonics are eigenstates of vibration of a sphere, think a perfectly spherical balloon (pictures here from Wikimedia.).

They are also the angular patterns of wave functions for atomic orbitals, these being eigenstates of angular momentum, represented in quantum mechanics by an operator equivalent to $\nabla^{2}$ acting on the surface of a sphere.

Addition Theorem

An important identity is the so-called addition theorem for spherical harmonics:

$P_{l} (\cos γ) = \frac{4 π}{2 l + 1} \sum_{m = - l}^{l} Y_{l}^{m} * (θ^{'}, ϕ^{'}) Y_{l}^{m} (θ, ϕ)$

where $\cos γ = \cos θ \cos θ^{'} + \sin θ \sin θ^{'} \cos (ϕ - ϕ^{'})$ .

In other words, $γ$ is the angle between the direction $(θ, ϕ)$ and the direction $(θ^{'}, ϕ^{'}),$ so $\cos γ$ is the dot product of two unit vectors of the form $(\sin θ \cos ϕ, \sin θ \sin ϕ, \cos θ) .$

Writing this as

$P_{l} (\cos γ) = \frac{4 π}{2 l + 1} \sum_{m = - l}^{l} 〈θ^{'}, ϕ^{'}| l, m〉 〈l, m |θ, ϕ〉,$

notice that $\sum_{m = - l}^{l} |l, m〉 〈l, m|$ is the identity operator within the eigenvalue $l$ subspace, hence clearly invariant under rotations, so we can conveniently re-orient to put $θ^{'} = 0,$ that is, along the $z$ axis. Now $Y_{l}^{m} (0, ϕ)$ is only nonzero for $m = 0,$ otherwise the $e^{i m ϕ}$ term would give a non-differentiable function along the axis. (Or, notice it has a factor ${(1 - x^{2})}^{|m| / 2}$ )

So, on the right-hand side, after re-orientation, only $m = 0$ contributes, and the normalization condition

$Y_{l 0} = \sqrt{\frac{2 l + 1}{4 π}} P_{l} (\cos θ)$

gives the result.

Factorizing a Potential

This proves very useful in “factorizing” the potential between two charges,

$\frac{1}{|\vec{r} - \vec{R}|} = \sum_{l = 0}^{\infty} \frac{r_{<}^{l}}{r_{>}^{l + 1}} P_{l} (\cos γ) = 4 π \sum_{l = 0}^{\infty} \frac{1}{2 l + 1} \frac{r_{<}^{l}}{r_{>}^{l + 1}} \sum_{m = - l}^{l} Y_{l}^{m} * (θ^{'}, ϕ^{'}) Y_{l}^{m} (θ, ϕ) .$

Important! As we shall soon see, the significance of this is that any charge distribution can be factored into monopole, dipole, quadrupole, etc., and this equation is telling us that the dipole component of a charge distribution generates a potential having dipole angular dependence, etc. As we’ll find later, this correspondence also holds between spherical components of oscillating charge distributions and components of the radiation emitted.

previous index next