4. Math Bootcamp

Introduction

Here we’ll first review basic vector calculus: grad, div, curl, Gauss’ theorem, Stokes’ theorem and their variants. This is essential material for graduate E&M! Another crucial tool for understanding these fields is the Fourier transform, which also introduces in a natural way Dirac’s indispensable delta function, discussed at the end of the lecture.

Vectors: Cartesian Coordinates, Kronecker Delta

In E&M (in contrast to, say, QM) our vectors are not in an abstract space, they’re in three dimensions.

The essential properties of a linear vector space are that addition and multiplication by a scalar (a number) are defined and give another vector in the space.

Our ordinary three-dimensional space is spanned by three orthonormal basis vectors, the unit vectors along the $x, y, z$ axes, often written $\hat{\vec{x}}, \hat{\vec{y}}, \hat{\vec{z}}$ .

For notational efficiency, it is often convenient to write these unit vectors ${\vec{e}}_{1}, {\vec{e}}_{2}, {\vec{e}}_{3},$ and their mutual orthogonality

${\vec{e}}_{i} \cdot {\vec{e}}_{j} = δ_{i j},$

this $δ_{i j}$ is called the Kronecker delta: it’s equal to 1 if $i = j$ , zero if $i \neq j$ .

Dummy Suffixes

Any 3D vector is a sum over components $—$ the basis vectors ${\vec{e}}_{i}$ are said to “span the space”:

$\vec{a} = a_{x} \hat{\vec{x}} + a_{y} \hat{\vec{y}} + a_{z} \hat{\vec{z}} \equiv a_{1} {\vec{e}}_{1} + a_{2} {\vec{e}}_{2} + a_{3} {\vec{e}}_{3} = a_{i} {\vec{e}}_{i},$

where we’ve introduced Einstein’s dummy suffix notation in that last step:

If a suffix appears twice in an expression, the sum over 1,2,3 is understood.

If the same suffix appears more than twice, you made a mistake $—$ the expression is meaningless.

The position vector

$\vec{r} = (x, y, z) = (x_{1}, x_{2}, x_{3})$

becomes

$\vec{r} = x_{i} {\vec{e}}_{i} .$

(It is also often written $r_{i} {\vec{e}}_{i} .$ )

The magnitude of the vector is written

$|\vec{a}| = \sqrt{a_{1}^{2} + a_{2}^{2} + a_{3}^{2}} = \sqrt{a_{i} a_{i}} .$

(You may be aware that in general relativity, this would be written $\sqrt{a_{i} a^{i}},$ and summation is only allowed between up and down indices. This refinement is unnecessary in E&M in ordinary space, that is, not gravitationally warped space).

Rotating a Displacement Vector, Orthogonal Matrices

Consider the incremental displacement vector

$Δ \vec{x} = (Δ x_{1}, Δ x_{2}, Δ x_{3}) .$

We know how this transforms under a rotation: for example, a rotation of the displacement vector about the $z$ axis gives new components in the fixed frame of reference:

$(\begin{matrix} Δ {x^{'}}_{1} \\ Δ {x^{'}}_{2} \\ Δ {x^{'}}_{3} \end{matrix}) = (\begin{matrix} \cos θ & - \sin θ & 0 \\ \sin θ & \cos θ & 0 \\ 0 & 0 & 1 \end{matrix}) (\begin{matrix} Δ x_{1} \\ Δ x_{2} \\ Δ x_{3} \end{matrix})$ .

(One way to see this is to write $Δ \vec{x}$ as a sum of three vectors, each along an axis, then rotate them separately, and write down their new components, then add.)

However, confusingly, a more common transformation is to find the new components of a fixed vector in a new frame of reference, which is, say, the old frame of reference rotated by $θ$ about the $z$ axis. This is of course

$(\begin{matrix} Δ {x^{'}}_{1} \\ Δ {x^{'}}_{2} \\ Δ {x^{'}}_{3} \end{matrix}) = (\begin{matrix} \cos θ & \sin θ & 0 \\ - \sin θ & \cos θ & 0 \\ 0 & 0 & 1 \end{matrix}) (\begin{matrix} Δ x_{1} \\ Δ x_{2} \\ Δ x_{3} \end{matrix}) .$

(It's worth mentally checking for a small angle to be sure you've got this sign right.)

This is an orthogonal matrix: it transforms orthogonal vectors into orthogonal vectors, its transpose is its inverse, obvious from the form above.

Since a general rotation can be written as a succession of rotations around axes, $A^{- 1} = A^{T}$ for any rotation.

Exercise: Prove that if $A^{- 1} = A^{T}, B^{- 1} = B^{T}$ then ${(A B)}^{- 1} = {(A B)}^{T} .$

We’ll write the general rotation matrix as

$Δ {x^{'}}_{i} = R_{i j} Δ x_{j} .$

Regarding the displacements as infinitesimals, we see that the transformation can be written

$d {x^{'}}_{i} = \frac{\partial {x^{'}}_{i}}{\partial x_{j}} d x_{j}$

and of course from the chain rule,

$\frac{\partial {x^{'}}_{i}}{\partial x_{j}} \frac{\partial x_{j}}{\partial {x^{'}}_{k}} = \frac{\partial {x^{'}}_{i}}{\partial {x^{'}}_{k}} = δ_{i k} .$

From this, we can deduce that for a rotational change of basis, the differential operator transforms as

$\frac{\partial}{\partial {x^{'}}_{i}} = \frac{\partial x_{j}}{\partial {x^{'}}_{i}} \frac{\partial}{\partial x_{j}} .$

We’ll return to this soon.

Definition of a Vector

It’s not just those three numbers $—$ a vector isn’t given just by specifying its components in one frame of reference, there has to be a prescription for finding its components in any frame of reference.

So we define a vector $\vec{a}$ having components $(a_{1}, a_{2}, a_{3})$ in one frame and by requiring that the components in a new frame are given by exactly the same linear transformation that gives the components of a displacement vector $Δ \vec{x}$ in the new frame in terms of the components $(Δ x_{1}, Δ x_{2}, Δ x_{3})$ in the original frame, that is, as discussed above, $Δ {x^{'}}_{i} = R_{i j} Δ x_{j} .$

Hence by definition, under rotation the components of a vector $\vec{a}$ transform by

${a^{'}}_{i} = R_{i j} a_{j} .$

Definition of a Tensor

For future reference, we'll mention here what are called Cartesian tensors. The vector has the single suffix $i,$ as in $a_{i},$ a tensor has two or more suffixes, hence 9, 27, etc. components in three dimensions.

Each component transforms exactly as does the single $-$ suffix vector, meaning that under rotation, when for a vector ${a^{'}}_{i} = R_{i j} a_{j},$ for a second-rank (two suffixes) tensor

${T^{'}}_{i j} = R_{i k} R_{j l} T_{k l} .$

Each suffix transforms by the vector rule. We’ll soon come across important tensors.

Finally, it’s worth asking: what are the eigenvectors of a rotation matrix? Let’s look at the one above, for rotation about the $z$ axis. Obviously, the only vector unaffected is one in the $z$ direction! But it’s a 3X3 matrix, so two other vectors must be just multiplied by a number $—$ the eigenvalue $—$ and stay pointing in the same direction. To see how that can be, we have to solve the eigenvalue equation

$(\begin{matrix} \cos θ & \sin θ & 0 \\ - \sin θ & \cos θ & 0 \\ 0 & 0 & 1 \end{matrix}) (\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \end{matrix}) = λ (\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \end{matrix})$

from which $(1 - λ) (λ^{2} - 2 λ \cos θ + 1) = 0, λ = 1, e^{i θ}, e^{- i θ} .$

The $λ = 1$ result has eigenvector $(0, 0, 1)$ , just the axis of rotation, the others are $(1, i, 0), (1, - i, 0) .$ That's why we couldn't see them $—$ they're in complex three-space.

Exercise: check explicitly that these are indeed eigenvectors.

In fact, these eigenvectors correspond to the states of a spin one particle in quantum mechanics, and they are also relevant in describing circularly polarized light, as we shall see.

Dot Product

The scalar or dot product of two vectors is

$\vec{a} \cdot \vec{b} = a_{i} b_{i},$

using the dummy suffix notation.

Note that the property $R^{- 1} = R^{T}$ guarantees that inner products, including vector length $|\vec{a}| = \sqrt{a_{i} a_{i}},$ will be preserved under transformation to another orthogonal base.

Check this: write ${a^{'}}_{i} = R_{i j} a_{j}, {b^{'}}_{i} = R_{i j} b_{j}$ show ${a^{'}}_{i} {b^{'}}_{i} = a_{i} b_{i} .$

Evaluating ${(\vec{a} + \vec{b})}^{2}$ makes clear that the dot product $\vec{a} \cdot \vec{b} = a b \cos θ,$ where $θ$ is the angle between the vectors, also it’s evidently the component of one vector in the direction of the other, multiplied by that vector’s magnitude.

Cross Product, Levi-Civita Symbol, Triple Product

The vector or cross product of two vectors

$\vec{a} \times \vec{b} = (a_{2} b_{3} - a_{3} b_{2}, a_{3} b_{1} - a_{1} b_{3}, a_{1} b_{2} - b_{1} a_{2}) .$

This cross product is written more succinctly using the Levi-Civita symbol, $ε_{i j k}$ .

Here the suffixes $i, j, k$ go over the values 1,2,3; and $ε_{123} = ε_{231} = ε_{312} = 1,$ the counter clockwise $ε_{132} = ε_{321} = ε_{213} = - 1$ and if any two suffixes are equal, $ε = 0$ .

Thus

${(\vec{a} \times \vec{b})}_{i} = ε_{i j k} a_{j} b_{k},$

( $j, k$ being summed-over dummy suffixes.)

This has a simple geometric interpretation: taking the two vectors to have a common end (say, the origin) see them as two adjacent sides of a parallelogram. The vector $\vec{a} \times \vec{b}$ has magnitude equal to the area of this parallelogram, and direction perpendicular to the plane.

The sign has to be fixed by convention: it's defined to be the direction of progress of a right-handed corkscrew, the handle being rotated from $\vec{a}$ to $\vec{b} .$

Note that $\vec{a} \times \vec{b} = - \vec{b} \times \vec{a} .$

The triple product $(\vec{a} \times \vec{b}) \cdot \vec{c}$ is a scalar with magnitude equal to the volume of the parallelepiped having the three vectors as edges with their (non-arrowhead) ends at the same corner. Think of it as base area times height: the dot product of $\vec{c}$ with $\vec{a} \times \vec{b}$ equals the base area $|\vec{a} \times \vec{b}|$ multiplied by the component of $\vec{c}$ perpendicular to the base area plane $—$ and that’s the height of the parallelepiped.

An Important Identity

A triple product that often comes up in E&M is

$\vec{a} \times (\vec{b} \times \vec{c}) = \vec{b} (\vec{a} \cdot \vec{c}) - \vec{c} (\vec{a} \cdot \vec{b}) .$

This is easily proved using the important identity

$ε_{i j k} ε_{i l m} = δ_{j l} δ_{k m} - δ_{j m} δ_{k l} .$

This identity can be established by listing possible values $—$ there aren't that many, remember the $ε$ 's are zero if two suffixes are the same. Do it!

It's also worth thinking about the vector identity geometrically: $\vec{a} \times (\vec{b} \times \vec{c})$ must be in the $\vec{b}, \vec{c}$ plane, since it's perpendicular to $\vec{b} \times \vec{c},$ so it's some linear combination of $\vec{b}, \vec{c},$ and it must be perpendicular to $\vec{a} .$

What’s a Vector Field?

A vector field is generalization of an ordinary function $f (\vec{r})$ .

An ordinary function assigns a number to each point in the space where it’s defined, for example a map of temperature over an area (or even a volume).

A vector field gives a vector for each point in some space, such as $\vec{E} (\vec{r}) = \frac{1}{4 π ε_{0}} \frac{q \hat{\vec{r}}}{r^{2}}, r \neq 0$ .

Exercise: sketch roughly the magnetic field for a small bar magnet. That’s a vector field: strictly, there’s an arrow at each point in space.

What we’ve called an ordinary function is often called a scalar function or scalar field, to contrast it with a vector field. The electrostatic potential in some volume is a scalar field.

The Gradient Operator

Definition

The gradient operator, called grad, is

$\vec{\nabla} = \hat{\vec{x}} \frac{\partial}{\partial x} + \hat{\vec{y}} \frac{\partial}{\partial y} + \hat{\vec{z}} \frac{\partial}{\partial z} = {\vec{e}}_{i} \frac{\partial}{\partial x_{i}} .$

To picture the gradient operator, we’ll look at a two-dimensional example: a contour map of a stretch of territory with mountains, etc.

The many lines on the map connecting points at the same height are called contours. Essentially, the map shows height as a function of latitude and longitude, we’ll use $(x, y)$ coordinates from some local origin, so the height is a scalar function $h (x, y)$ .

Then $\vec{\nabla} h (x, y)$ is a vector field, giving, in fact, the local gradient in the usual geographic sense, that is, rate of change of height with respect to change in horizontal position.

If you take a step on this terrain $Δ \vec{r} = (Δ x, Δ y)$ your change in altitude is $\vec{\nabla} h \cdot Δ \vec{r} .$

Exercise: Check from the transformation laws given above that $Δ \vec{r} \cdot \vec{\nabla} f$ is invariant under rotations, and interpret your result. Now prove that taking a walk and coming back, $\oint_{closed path} \vec{\nabla} h \cdot d \vec{r} = 0$ .

Evidently, $\vec{\nabla} h$ is a vector field in the two-dimensional space, and in fact it’s just the component of the gravitational field for motion confined to the surface (the other is cancelled by the normal force holding you to the surface), that is, $m g \vec{\nabla} h \cdot d \vec{r}$ is the work you do against gravity when you take the small step with horizontal components $d \vec{r}$ , this expression is the increase in your gravitational potential energy.

Equipotentials and the Gradient Operator

The lines of constant height on a geographical map, the contour lines mentioned above, are clearly gravitational equipotentials, it takes no work against gravity to walk around at constant height, your potential energy doesn’t change, so, as we've found, for a step $d \vec{r}$ along such a line we must have $\vec{\nabla} h \cdot d \vec{r} = 0$ .

That is to say, the vector field $\vec{\nabla} h$ is perpendicular to the equipotential lines everywhere, it’s pointing along the line of maximum slope (often called steepest descent). Analogously, the direction of the electric field at a point in an electrostatic problem is always perpendicular to the equipotential surface (this is three dimensions) through that point.

Exercise: (We'll be doing this in more detail later, this is to check you get the idea.) Sketch equipotentials and field lines for a system of two equal magnitude charges: same charge, opposite charge. In particular, sketch the lines near a saddle point: the midpoint between two like charges.

The Divergence Operator: Definitions, Gauss’ Theorem, Visualization

Definition

The divergence (called div) is formally defined as the differential operator acting on a vector field:

$\vec{\nabla} \cdot \vec{V} = \frac{\partial V_{x}}{\partial x} + \frac{\partial V_{y}}{\partial y} + \frac{\partial V_{z}}{\partial z} = \frac{\partial V_{i}}{\partial x_{i}} .$

There’s another way to define divergence: take a tiny cubical box, sides $d x, d y, d z$ around some point $(x, y, z)$ then

$\vec{\nabla} \cdot \vec{V} (x, y, z) = \lim_{d x, d y, d z \to 0} \frac{\int_{surface of cube} \vec{V} \cdot d \vec{S}}{d x d y d z}$

Here $d \vec{S}$ is a vector representing an increment of surface area $—$ its magnitude is the area, its direction is perpendicular to the surface, but we have to specify which way, for a closed surface the convention is almost always outwards.

Let’s check that the two definitions are equivalent: consider first the contribution to the surface integral from the two faces perpendicular to the $x$ axis, they have areas of magnitude $d y d z$ . In the limit of a small box, we can assume the variation of $\vec{V}$ on these faces in the $y, z$ directions can be neglected, but the variation between the two faces because they are separated by $d x$ is the only contribution to the integral, since the $d \vec{S}$ for the two faces are in opposite directions, $\pm \hat{\vec{x}}$ .

Hence the net contribution to the integral over the cube surface from those faces is $\frac{\partial V_{x}}{\partial x} d x$ multiplied by the area $d y d z$ , and the product of infinitesimals $d x d y d z$ cancels with the denominator (in the expression for $\vec{\nabla} \cdot \vec{V}$ above) leaving just $\partial V_{x} / \partial x$ . The two other pairs of faces make corresponding contributions, establishing the equivalence of the two definitions.

Gauss’ Theorem

Gauss’ Theorem, a. k. a. the Divergence Theorem, is:

$\int_{V} d^{3} r \vec{\nabla} \cdot \vec{F} = \int_{S} d \vec{S} \cdot \vec{F} .$

This easily follows from the “small box” definition of the divergence: divide the whole volume into tiny boxes, so the volume integral is a surface integral over these myriads of surfaces. But the integrals over the internal surfaces all cancel, leaving just that over the bounding surface.

Visualizing Divergence: Velocity Field for an Incompressible Fluid

To picture what the divergence of a vector field can look like, it’s helpful to visualize the flow of an incompressible fluid, with the velocity vector field $\vec{v} (\vec{r}) .$ Let’s assume at the moment that it’s a steady flow, not varying in time. Obviously, if the fluid is incompressible, it can’t be piling up anywhere, and we’ll assume there are no bubbles, it fills all the space we’re considering. (We’re taking the density of the fluid to be $ρ = 1$ , so the velocity field is the same as mass flow $—$ remember the fluid is incompressible.)

Then, from the small cube picture, as much is flowing into any space as is flowing out, $\vec{\nabla} \cdot \vec{v} = 0$ everywhere. How do we change that? By having a source of fluid.

Consider a large body of water, like a deep large pool, and slip a thin hosepipe in, so the end is deep in the water somewhere. Now pump in water at a steady rate. Once things have settled down, if you imagine any surface within the water that encloses the source, $\int_{} \vec{v} \cdot d \vec{S}$ over that surface must equal the rate of supply of water, call it $q$ .

If the water is at rest apart from the motion caused by this supply, $\vec{v}$ will point outwards symmetrically from the source, and the surface integral, now taken over a sphere, will tell us that $\vec{v} = q \hat{\vec{r}} / 4 π r^{2}$ . Furthermore, $\vec{\nabla} \cdot \vec{v}$ is zero everywhere except at the source, where it’s evidently large. (We’re assuming the walls are far away and we are ignoring them.)

This is of course the pattern of the electric field from a localized charge distribution. We see that $\vec{\nabla} \cdot \vec{v}$ at the hose is the local rate of water supply, integrating over the finite volume of the hose nozzle gives the total supply. Similarly, the divergence of the electric field gives the local charge density: the first of Maxwell’s equations is

$\vec{\nabla} \cdot \vec{E} = \frac{ρ}{ε_{0}} .$

A negative charge, in our watery analogy, is like a hosepipe with water being pumped away: imagine the flow field for two hoses not far from each other, one supplying water, the other taking it away. This is like two opposite charges.

A truly point charge has its problems, mathematically and physically, which we’ll discuss later. What about a uniformly charged (throughout its volume) sphere? That has uniform nonzero divergence inside it, and it’s an easy exercise using the divergence theorem to find that the spherically symmetric electric field increases linearly from zero at the center, to the sphere’s surface, then falls off as the inverse square. (This is the same as the gravitational field inside and outside the Earth, apart from sign, and assuming a uniform-density Earth.)

Variations of Gauss’ Theorem

Consider the abstract vector field $\vec{F} (\vec{r}) = \vec{c} p (\vec{r})$ where $\vec{c}$ is a constant vector, and $p (\vec{r})$ a scalar function of position. Gauss' theorem for this field is

$\vec{c} \cdot \int_{S} p d \vec{S} = \vec{c} \cdot \int_{V} \vec{\nabla} p d^{3} r .$

Exercise: Think about $p (\vec{r})$ as the local pressure in a fluid at rest under gravity. Take some volume completely inside the fluid, and interpret the equation physically.

More variations: writing Gauss’ theorem as

$\int_{V} d^{3} r \partial_{i} F_{i} = \int_{S} d a n_{i} F_{i},$

where $n_{i}$ is the unit vector pointing outwards, we see that we can formally replace the $F_{i}$ by a tensor $F_{i j}$ or $F_{i j k},$ etc., to get

$\int_{V} d^{3} r \partial_{i} F_{i j} = \int_{S} d a n_{i} F_{i j},$

because this is just the divergence theorem repeated three times, for $j = 1, 2, 3.$

The Curl Operator: Definitions, Stokes’ Theorem, Visualizations

Definition

The curl is:

$\vec{\nabla} \times \vec{V} = \hat{\vec{x}} (\frac{\partial V_{z}}{\partial y} - \frac{\partial V_{y}}{\partial z}) + \hat{\vec{y}} (\frac{\partial V_{x}}{\partial z} - \frac{\partial V_{z}}{\partial x}) + \hat{\vec{z}} (\frac{\partial V_{y}}{\partial x} - \frac{\partial V_{x}}{\partial y}) .$

This rather cumbersome expression is more succinctly written using the Levi-Civita symbol, $ε_{i j k}$ introduced earlier. Recall, the suffixes $i, j, k$ go over the values 1,2,3; and $ε_{123} = ε_{231} = ε_{312} = 1,$ the counter clockwise $ε_{132} = ε_{321} = ε_{213} = - 1$ and if any two suffixes are equal, $ε = 0$ .

So switching from $x, y, z$ to 1,2,3,

${(\vec{\nabla} \times \vec{V})}_{i} = ε_{i j k} \frac{\partial}{\partial x_{j}} V_{k} .$

And, just as for the div, there’s an integral way to express this:

To find the component of the curl in a particular direction, we take a small square contour perpendicular to that direction (so its area $d \vec{S}$ is in that direction) and find the limit of a line integral of the vector field around that contour as the contour size goes to zero, then divide the result by the area of the square.

So for the $x$ component,

${(\vec{\nabla} \times \vec{V} (x, y, z))}_{x} = \lim_{d y, d z \to 0} \frac{\int_{perimeter of square} \vec{V} \cdot d \vec{l}}{d y d z}$

The argument is closely parallel to that for the divergence, now we take opposite sides together, they nearly cancel, etc.: left as an exercise for the reader.

Stokes’ Theorem

This definition leads easily to Stokes’ Theorem relating an integral over an open surface to an integral around its boundary:

$\int_{S} d \vec{S} \cdot \vec{\nabla} \times \vec{F} = \oint_{C} d \vec{l} \cdot \vec{F} .$

Now we divide the surface into many small squares, taking the sum of the line integrals around all of them, all the interior parts cancel in pairs, leaving the integral around the perimeter, the curve $C$ , the edge of the surface $S$ .

The Electrostatic Field Has Zero Curl

Evidently, since $\oint_{C} \vec{E} \cdot d \vec{l}$ is the change in potential on going around a closed circuit, zero, an electrostatic field from a point charge has zero curl everywhere, and therefore so does any other electrostatic field, since experimentally it’s found that electric fields just add linearly.

This is really equivalent to saying that any electrostatic field is the gradient of a single-valued potential,

$\vec{E} (\vec{r}) = - \vec{\nabla} φ (\vec{r}),$

it’s easy to see that $\vec{\nabla} \times \vec{E} = - \vec{\nabla} \times \vec{\nabla} φ = - ε_{i j k} \frac{\partial^{2} φ}{\partial x_{j} \partial x_{k}} = 0$ from the antisymmetry $ε_{i j k} = - ε_{i k j}$ .

(Nitpicking note: We’ll soon see that zero curl only necessarily means the field is a gradient if the space is simply connected.)

Visualizing Curly Fields: A Whirlpool in Incompressible Fluid and its Magnetic Analog

From this picture of the curl as the limit of an integral around a small contour, divided by the area enclosed by the contour, can that be visualized as the flow field of some incompressible fluid?

(The motivation here is visualizing the magnetostatic (time-independent) field, for which Maxwell’s second equation, $\vec{\nabla} \cdot \vec{B} = 0,$ assures us a sourceless incompressible fluid scenario will be valid) Furthermore, for static fields we have $\vec{\nabla} \times \vec{B} = μ_{0} \vec{j}$ , $\vec{j}$ being the current density.)

Consider the velocity field from a whirlpool. We’ll just keep to two dimensions for simplicity, looking at the surface, say of an emptying bathtub over the drain, or a hurricane viewed from a satellite. Clearly now $\oint_{circle} \vec{v} \cdot d \vec{l} \neq 0$ for a circle centered at the origin, there's a whirlpool. In fact, for an incompressible inviscid fluid in steady rotational motion of this type, Kelvin proved that the integral

$\oint_{circle} \vec{v} \cdot d \vec{l} = K,$

a path-independent constant for any contour including the origin. $K$ is called the circulation. It follows that the integral is zero if the closed contour does not include the origin, and that means $\vec{\nabla} \times \vec{v} = 0$ for this flow except at the origin, where it is a two-dimensional delta function of strength $K .$

In fact, this velocity field is the same as the magnetic field from a steady current in a long straight wire: that goes down as $1 / r$ with distance from the wire.

Obviously, this field cannot be represented as the gradient of a single-valued potential: if we had a magnetic monopole, and allow it to move on a circular track around the wire, it would circle around with more and more energy! Actually that scenario is essentially equivalent to one of Faraday’s first electric motors: (the one on the right in the illustration) he had a long vertical bar magnet, the north pole above the surface of a pool of mercury, the south pole far below, and a current-carrying wire suspended from a point vertically above the north pole, its end in the mercury. The wire rotated in the mercury.

Exercise: Where is the energy to keep the wire rotating coming from? And, what’s going on on the left side of the illustration?

Evidently, although $\vec{\nabla} \times \vec{B} = 0$ everywhere outside the wire, this doesn’t guarantee that $\vec{B} = \vec{\nabla} ψ,$ for some scalar potential, obviously it isn’t.

Contrasting Different Curly Vector Fields: The Whirlpool and The Rotating Solid Body

Let’s think a bit more about the curl of the velocity field of a whirlpool: all the velocity lines move in circles, it looks a bit curly. But to check, take a small contour between two adjacent circles, with radial ends, which won’t contribute to the integral. Around the curved sides, the speed goes as $1 / r$ , but the length of contour goes as $r$ , so the two contributions to the contour integral from the curved sides cancel, meaning this flow field has zero curl (except at the central point).

Now contrast this with the velocity field of a rotating solid body: taking the same little contour, the contributions from the curved sides, using $v = ω r$ , will be

$ω {(r + d r)}^{2} d θ - ω r^{2} d θ$

from which $\vec{\nabla} \times \vec{v} = 2 ω \hat{\vec{z}}$ , where $\hat{\vec{z}}$ is a unit vector along the axis of rotation: the velocity curl has uniform magnitude throughout the rotating body.

In fact, this uniform curl describes the magnetic field inside a thick wire carrying a current, provided the current density is uniform over the whole cross-section.

A useful way to see the difference between these two circulating fields (whirlpool and rotating body) is to visualize the motion of a small piece of paper floating on the fluid surface. The rotation of the piece of paper measures the local curl of the velocity field. For the rotating body, the piece of paper will obviously rotate at the same rate as the body. Going around the whirlpool, though, it won’t rotate at all $-$ because the fluid adjacent to its inner edge is moving faster, and this rotates the paper relative to a fixed direction just enough to compensate for its rotation from the circular motion around the whirlpool center.

When Does Zero Curl Mean the Field is a Gradient?

Now $\vec{\nabla} \times \vec{\nabla} f (\vec{r}) = 0$ for any scalar function of position. Does this mean that for a vector field $\vec{V},$ if $\vec{\nabla} \times \vec{V} = 0,$ there must be some function $f (\vec{r})$ such that $\vec{V} = \vec{\nabla} f$ ? A possible proof is given by writing $f (\vec{r}) = \int_{0}^{\vec{r}} \vec{V} ({\vec{r}}^{'}) \cdot d {\vec{r}}^{'} .$ This looks independent of the path, because if you choose two different paths, the difference between them is $\oint_{\begin{array}{l} surface \\ spanning \\ paths \end{array}} \vec{\nabla} \times \vec{V} \cdot d \vec{s}$ and we know $\vec{\nabla} \times \vec{V} = 0.$

But there’s a catch!

This only works in a simply connected space. Otherwise, you can’t put a surface spanning the two paths. For example, in an annular region, a whirlpool flow has zero curl, and in fact the velocity looks like $(1 / r) \vec{\nabla} θ,$ but $θ$ is not single valued, so it’s not a potential in the usual sense.

Similarly, if $\vec{\nabla} \cdot \vec{V} = 0,$ does this mean there’s a vector potential function $\vec{A} (\vec{r})$ such that $\vec{V} = \vec{\nabla} \times \vec{A}$ ? Again, the answer is yes, but with topological reservations, as will be made clear below.

In practice, the important application is to the magnetic field: since $\vec{\nabla} \cdot \vec{B} = 0,$ we have $\vec{B} = \vec{\nabla} \times \vec{A} .$ Griffiths has a long proof that includes working through two exercises. In any case, the result is a special case of Helmholtz’ theorem, which we shall go through in detail soon.

An interesting generalization of the vector potential is to try to construct if for a magnetic monopole, known as a Dirac monopole, predicted by some field theories, but so far undetected. We’ll examine that in detail later.

The Laplacian

One final operator that’s ubiquitous in electrostatics is the Laplacian

$\vec{\nabla} \cdot \vec{\nabla} φ = \nabla^{2} φ = \vec{\nabla} \cdot \vec{V} = \frac{\partial^{2} φ}{\partial x^{2}} + \frac{\partial^{2} φ}{\partial y^{2}} + \frac{\partial^{2} φ}{\partial z^{2}} .$

Most of electrostatics, in fact, is solutions of $\nabla^{2} φ =$ something, with various boundary conditions. You'll be seeing it a lot.

Fourier Transforms and Dirac’s Delta Function

Definition of Fourier Transform

(Note: for Fourier Series, see the basic review in my quantum notes here.)

First, in one dimension: a “reasonably smooth” function $f (x)$ that goes to zero as $x \to \pm \infty$ can be expressed as a sum over plane waves,

$f (x) = \int_{- \infty}^{\infty} \frac{d k}{2 π} f_{k} e^{i k x}$

where

$f_{k} = \int_{- \infty}^{\infty} f (x) e^{- i k x} d x .$

(Of course, we have to show these equations are consistent! We will.)

In three dimensions, the plane wave expression is $e^{i (\vec{k} \cdot \vec{r} - ω t)}$ from which, dealing for the moment with space and time separately, the standard notation is

$f (\vec{r}) = \int_{- \infty}^{\infty} \frac{d^{3} k}{{(2 π)}^{3}} f_{\vec{k}} e^{i \vec{k} \cdot \vec{r}}, f (t) = \int_{- \infty}^{\infty} \frac{d ω}{2 π} f_{ω} e^{- i ω t}$

and the inverse transforms have the obvious sign changes (and no $2 π$ denominators).

Fourier Transforms of Differential Operators

E&M is full of differential equations, and Fourier transforms are an important tool in finding solutions, for the simple reason that differentiation just becomes multiplication! Let’s see how this works.

If the Fourier transform of $f (\vec{r})$ is $f_{\vec{k}},$ that of $\vec{\nabla} f$ is $i \vec{k} f_{\vec{k}} .$ Similarly, if the (three-dimensional) transform of $\vec{A} (\vec{r})$ is ${\vec{A}}_{\vec{k}},$ so $\vec{A} (\vec{r}) = \int_{- \infty}^{\infty} \frac{d^{3} k}{{(2 π)}^{3}} {\vec{A}}_{\vec{k}} e^{i \vec{k} \cdot \vec{r}},$ then that of $\vec{\nabla} \cdot \vec{A}$ is $i \vec{k} \cdot {\vec{A}}_{\vec{k}}$ and that of $\vec{\nabla} \times \vec{A}$ is $i \vec{k} \times {\vec{A}}_{\vec{k}} .$

Important Exercise: Check these results by just differentiating $—$ of course, we're assuming the integrals still converge, in practice, this is rarely a problem.)

Consistency of the Fourier Transform Equations: Introducing the Delta Function

It’s now time to ask about the consistency of the first two equations:

$f (x) = \int_{- \infty}^{\infty} \frac{d k}{2 π} f_{k} e^{i k x}, f_{k} = \int_{- \infty}^{\infty} f (x) e^{- i k x} d x .$

Feeding the second into the first we have

$f (x) = \int_{- \infty}^{\infty} \frac{d k}{2 π} \int_{- \infty}^{\infty} f (x^{'}) e^{- i k x^{'}} d x^{'} e^{i k x} = \int_{- \infty}^{\infty} d x^{'} f (x^{'}) [\int_{- \infty}^{\infty} \frac{d k}{2 π} e^{i k (x - x^{'})}] .$

Remember this is a completely arbitrary function (well, differentiable) and ask how the first term can always equal the last term. It must be that the expression in the square brackets, which will clearly have to be expressed as the limit of some slightly better-defined sequence of integrals, picks out only the bit of $f (x^{'})$ at $x^{'} = x .$ How can that be? Let’s do the integral, putting in gentle cutoffs to make it well-defined:

$\lim_{ε \to 0} \int_{- \infty}^{\infty} \frac{d k}{2 π} e^{i k x - ε |k|} = \int_{- \infty}^{0} \frac{d k}{2 π} e^{i k (x - i ε)} + \int_{0}^{\infty} \frac{d k}{2 π} e^{i k (x + i ε)} .$

These are convergent integrals, and the result is called Dirac’s delta function,

$δ (x) = \lim_{ε \to 0} \frac{1}{2 π i} (\frac{1}{x - i ε} - \frac{1}{x + i ε}) = \lim_{ε \to 0} \frac{1}{π} \frac{ε}{x^{2} + ε^{2}} .$

For nonzero $ε,$ we see that this is a peaked function with height $1 / π ε$ and width $\sim ε$ and total area equal to one (an elementary integral). As $ε \to 0,$ it becomes infinitely localized. This limiting case is not a function in the traditional sense (meaning something that gives a definite value at each point in its range), despite its name, but it has a well-defined meaning provided it is in an integrand. Mathematicians call it a distribution, it has all its weight at one point. It would be difficult to do physics without the delta function, and physicists don’t insist that it always be in an integrand, as you will find. For example, it’s convenient to refer to a point charge as a delta function charge distribution.

Fourier Transform of Spherically Symmetrical Functions, and Specifically 1/r.

For a spherically symmetric function $f (\vec{r}) = f (r),$ the Fourier transform

$f_{\vec{k}} = \int f (r) e^{- i \vec{k} \cdot \vec{r}} d^{3} x = 4 π \int_{0}^{\infty} f (r) \frac{\sin k r}{k r} r^{2} d r .$

Hence for $f (r) = 1 / r,$

$f_{\vec{k}} = 4 π \int_{0}^{\infty} \frac{1}{r} \frac{\sin k r}{k r} r^{2} d r = \frac{4 π}{k} \int_{0}^{\infty} \sin k r d r .$

Writing $\sin k r = \frac{1}{2 i} (e^{i (k + i ε) r} - e^{- i (k - i ε) r})$ to make the integral converge (then we’ll take the limit $ε \to 0$ ) ,

$f_{\vec{k}} = \frac{4 π}{k} \frac{1}{2 i} (- \frac{1}{i (k + i ε)} + \frac{1}{- i (k - i ε)}) = \frac{4 π}{k^{2}} .$

To check consistency, going the other way, we replace $d^{3} x$ with $d^{3} k / {(2 π)}^{3}$ :

$f (r) = \frac{4 π}{{(2 π)}^{3}} \int_{0}^{\infty} f_{k} \frac{\sin k r}{k r} k^{2} d k = \frac{1}{2 π^{2}} \int_{0}^{\infty} \frac{4 π}{k^{2}} \frac{\sin k r}{k r} k^{2} d k = \frac{2}{π r} \int_{0}^{\infty} \frac{\sin k r}{k} d k .$

Now in that last integral write $k r = x$ and find $I = \int_{0}^{\infty} \frac{\sin x}{x} d x,$ independent of $r$ . To evaluate this integral, note that it is even, and half of the imaginary part of $\int_{- \infty}^{\infty} \frac{e^{i z}}{z} d z$ taken along the real axis. It’s OK to close in the upper half plane, the only contributor is the pole at the origin, so $I = π / 2, f (r) = \frac{1}{r} .$

Similarly, if $f (r) = \frac{1}{r^{2}},$ then $f_{k} = \frac{2 π^{2}}{k} .$ You’ll need these results in this course.

Note: if you're not familiar with integrals in the complex plane, check my quantum notes here, and perhaps also the preceding quantum lecture.

previous index next